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Alzheimer's Disease Secretase 

FIELD OF THE INVENTION 

The present invention related to the field of Alzheimer's Disease, APP, amyloid beta 
5 peptide, and human aspartyl proteases as well as a method for the identification of agents that 
modulate the activity of these polypeptides. 
BACKGROUND OF THE INVENTION 

Alzheimer's disease (AD) causes progressive dementia with consequent formation 
15 0 f amyloid plaques, neurofibrillary tangles, gliosis and neuronal loss. The disease occurs in 

10 both genetic and sporadic forms whose clinical course and pathological features are quite 
similar. Three genes have been discovered to date which when mutated cause an autosomal 
dominant form of Alzheimer's disease. These encode the amyloid protein precursor (APP) 
and two related proteins, presenilin-1 (PS 1) and presenilin-2 (PS2), which as their names 
suggest are both structurally and functionally related. Mutations in any of the three enhance 
15 proteolytic processing of APP via an intracellular pathway that produces amyloid beta 
25 peptide or the A£ peptide (or sometimes here as Abeta), a 40-42 amino acid long peptide 

that is the primary component of amyloid plaque in AD. Dysregulation of intracellular 
pathways for proteolytic processing may be central to the pathophysiology of AD. In the 
case of plaque formation, mutations in APP, PS I or PS2 consistently alter the proteolytic 
20 processing of APP so as to enhance formation of A|3 1-42, a form of the A(i peptide which 
seems to be particularly amyloidogenic, and thus very important in AD. Different forms of 
APP range in size from 695-770 amino acids, localize to the cell surface, and have a single 
35 C-terminal transmembrane domain. The Abeta peptide is derived from a region of APP 

adjacent to and containing a portion of the transmembrane domain. Normally, processing 
25 of APP at the a-secretase site cleaves the midregion of the Ap sequence adjacent to the 
membrane and releases the soluble, extracellular domain of APP from the cell surface. 
This a-secretase APP processing, creates soluble APP- a, and it is normal and not thought 
to contribute to AD. 

Pathological processing of APP at the [3- and v-secretase sites produces a very 
3(1 different result than processing at the a site. Sequential processing at the (5- and y-secrctase 
sites releases the AP peptide, a peptide possibly very important in AD pathogenesis. 
Processing at the (3- and "r-secrctase sites can occur in both the endoplasmic reticulum (in 
50 neurons) and in the endosomal/lysosomal pathway after reintcmalization of cell surface 
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APP (in all cells). Despite intense efforts, for 10 years or more, to identify the enzymes 
5 responsible for processing APP at the |3 and y sites, to produce the A0 peptide, those 

proteases remained unknown until this disclosure. Here, for the first time, we report the 
identification and characterization of the |J secretase enzyme. We disclose some known and 
10 5 some novel human aspartic proteases that can act as p-secretase proteases and, for the first 

time, we explain the role these proteases have in AD. We describe regions in the proteases 
critical for their unique function and for the first time characterize their substrate. This is 
the first description of expressed isolated purified active protein of this type, assays that use 
the protein, in addition to the identification and creation of useful cell lines and inhibitors. 
io SUMMARY OF THE INVENTION 

Here we disclose a number of variants of the asp2 gene and peptide. 
20 Any isolated or purified nucleic acid polynucleotide that codes for a protease 

capable of cleaving the beta (0) secretase cleavage site of APP that contains two or more 
sets of special nucleic acids, where the special nucleic acids are separated by nucleic acids 
15 that code for about 100 to 300 amino acid positions, where the amino acids in those 

25 

positions may be any amino acids, where the first set of special nucleic acids consists of the 
nucleic acids that code for the peptide DTG, where the first nucleic acid of the first special 
set of nucleic acids is, the first special nucleic acid, and where the second set of nucleic 

30 ac ids code for either the peptide DSG or DTG, where the last nucleic acid of the second set 

20 of nucleic acids is the last special nucleic acid, with the proviso that the nucleic acids 
disclosed in SEQ ID NO. 1 and SEQ. ID NO. 5 are not included. The nucleic acid 
polynucleotide of claim 1 where the two sets of nucleic acids are separated by nucleic acids 
that code for about 125 to 222 amino acid positions, which may be any amino acids.The 
nucleic acid polynucleotide of claim 2 that code for about 150 to 172 amino acid positions, 
25 which may be any amino acids. The nucleic acid polynucleotide of claim that code for 

40 about 172 amino acid positions, which may be any amino acids. The nucleic acid 

polynucleotide of claim 4 where the nucleotides are described in SEQ. ID. NO. 3 The 
nucleic acid polynucleotide of claim 2 where the two sets of nucleic acids are separated by 
nucleic acids that code for about 150 to 196 amino acid positions. The nucleic acid 

45 

30 polynucleotide of claim 6 where the two sets of nucleotides are separated by nucleic acids 
that code for about 196 amino acids (positions). The nucleic acid polynucleotide of claim 7 
where the two sets of nucleic acids are separated by the same nucleic acid sequences that 
50 separate the same set of special nucleic acids in SEQ. ID. NO. 5. The nucleic acid 
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polynucleotide of claim 4 where the two sets of nucleic acids are separated by nucleic acids 
that code for about 150 to 190, amino acid (positions). The nucleic acid polynucleotide of 
claim 9 where the two sets of nucleotides are separated by nucleic acids that code for about 
190 amino acids (positions). The nucleic acid polynucleotide of claim 10 where the two 

5 sets of nucleotides are separated by the same nucleic acid sequences that separate the same 
set of special nucleotides in SEQ. ED. NO. 1. Claims 1-L 1 where the first nucleic acid of 
the first special set of amino acids, that is, the first special nucleic acid, is operably linked 
to any codon where the nuclic acids of that codon codes for any peptide comprising from 1 
to 10,000 amino acid (positions). The nucleic acid polynucleotide of claims 1-12 where the 

10 first special nucleic acid is operably linked to nucleic acid polymers that code for any 

peptide selected from the group consisting of: any any reporter proteins or proteins which 
facilitate purification/The nucleic acid polynucleotide of claims 1-13 where the first special 
nucleic acid is operably linked to nucleic acid polymers that code for any peptide selected 
from the group consisting of: immunoglobin- heavy chain, maltose binding protein, 

15 glutathion S transfecuon. Green Fluorescent protein, and ubiquitin. Claims 1-14 where the 
last nucleic acid of the second set of special amino acids, that is, the last special nucleic 
acid, is operably linked to nucleic acid polymers that code for any peptide comprising any 
amino acids from 1 to 10,000 amino acids. Claims 1-15 where the last special nucleic acid 
is operably linked to any codon linked to nucleic acid polymers that code for any peptide 

20 selected from the group consisting of: any reporter proteins or proteins which facilitate 

purification. The nucleic acid polynucleotide of claims 1-16 where the first special nucleic 
acid is operably linked to nucleic acid polymers that code for any peptide selected from the 
group consisting of: immunoglobin-heavy chain, maltose binding protein, glutathion S 
transfection, Green Fluorescent protein, and ubiquitin. 

25 Any isolated or purified nucleic acid polynucleotide that codes for a protease 

capable of cleaving the beta secretase cleavage site of APP that contains two or more sets of 
special nucleic acids, where the special nucleic acids are separated by nucleic acids that 
code for about 100 to 300 amino acid positions, where the amino acids in those positions 
may be any amino acids, where the first set of special nucleic acids consists of the nucleic 

30 acids that code for DTG, where the first nucleic acid of the first special set of nucleic acids 
is, the first special nucleic acid, and where the second set of nucleic acids code for either 
DSG or DTG, where the last nucleic acid of the second set of special nucleic acids is the 
last special nucleic acid, where the first special nucleic acid is operably linked to nucleic 
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acids that code for any number of amino acids from zero to 81 amino acids and where each 
of those codons may code for any amino acid. The nucleic acid polynucleotide of claim 18 
, where the first special nucleic acid is operably linked to nucleic acids that code for any 
number of from 64 to 77 amino acids where each codon may code for any amino acid. The 

W 5 nucleic acid polynucleotide of claim 19 , where the first special nucleic acid is operably 

linked to nucleic acids that code for 71 amino acids. The nucleic acid polynucleotide of 
claim 20, where the first special nucleic acid is operably linked to 71 amino acids and 
where the first of those 7 1 amino acids is the amino acid T. The nucleic acid 
polynucleotide of claim 21, where the polynucleotide comprises a sequence that is at least 
10 95% identical to SEQ. ID. (Example 1 1). The nucleic acid polynucleotide of claim 22, 
where the complete polynucleotide comprises SEQ. ID. (Example 1 1). The nucleic acid 

20 polynucleotide of claim 18 , where the first special nucleic acid is operably linked to nucleic 

acids that code for any number of from 40 to 54 amino acids where each codon may code 
for any amino acid. The nucleic acid polynucleotide of claim 24, where the first special 

^ 15 nucleic acid is operably linked to nucleic acids that code for 47 amino acids. The nucleic 

acid polynucleotide of claim 20, where the first special nucleic acid is operably linked to 47 
codons where the first those 47 amino acids is the amino acid E. The nucleic acid 
polynucleotide of claim 21, where the polynucleotide comprises a sequence that is at least 

30 95% identical to SEQ. ID. (Example 10). The nucleic acid polynucleotide of claim 22, 

20 where the complete polynucleotide comprises SEQ. ID. (Example 10). 

Any isolated or purified nucleic acid polynucleotide that codes for a protease 
capable of cleaving the beta <P) secretase cleavage site of APP that contains two or more 

35 

sets of special nucleic acids, where the special nucleic acids are separated by nucleic acids 
that code for about 100 to 300 amino acid positions, where the amino acids in those 
25 positions may be any amino acids, where the first set of special nucleic acids consists of the 

40 nucleic acids that code for the peptide DTG, where the first nucleic acid of the first special 

set of amino acids is, the first special nucleic acid, and where the second set of special 
nucleic acids code for either the peptide DSG or DTG, where the last nucleic acid of the 

^ 5 second set of special nucleic acids, the last special nucleic acid, is operably linked to 

30 nucleic acids that code for any number of codons from 50 to 170 codons. The nucleic acid 
polynucleotide of claim 29 where the last special nucleic acid is operably linked to nucleic 
acids comprising from 100 to 170 codons. The nucleic acid polynucleotide of claim 30 

50 where the last special nucleic acid is operably linked to nucleic acids comprising from 142 
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to 163 codons. The nucleic acid polynucleotide of claim 31 where the last special nucleic 

5 

acid is operably linked to nucleic acids comprising about 142 codons. The nucleic acid 
polynucleotide of claim 32 where the polynucleotide comprises a sequence that is at least 
95% identical to SEQ. ID. (Example 9 or 10). The nucleic acid polynucleotide of claim 
10 5 33, where the complete polynucleotide comprises SEQ. ID. (Example 9 or 10). The 

nucleic acid polynucleotide of claim 3 1 where the last special nucleic acid is operably 
linked to nucleic acids comprising about 163 codons. The nucleic acid polynucleotide of 
claim 35 where the polynucleotide comprises a sequence that is at least 95% identical to 
SEQ. ID. (Example 9 or 10). The nucleic acid polynucleotide of claim 36, where the 
10 complete polynucleotide comprises SEQ. ID. (Example 9 or 10). The nucleic acid 

polynucleotide of claim 3 1 where the last special nucleic acid is operably linked to nucleic 
20 acids comprising about 170 codons. Claims 1-38 where the second set of special nucleid 

acids code for the peptide DSG, and optionally the first set of nucleic acid polynucleotide is 
operably linked to a peptide purification tag. Claims 1-39 where the nucleic acid 
15 polynucleotide is operably linked to a peptide purification tag which is six histidinc. 

Claims 1-40 where the first set of special nucleic acids are on one polynucleotide and the 
second set of special nucleic acids are on a second polynucleotide, where both first and 
second polynucleotides have at lease 50 codons. Claims 1-40 where the first set of special 
30 nucleic acids are on one polynucleotide and the second set of special nucleic acids are on a 

20 second polynucleotide, where both first and second polynucleotides have at lease 50 codons 
where both said polynucleotides are in the same solution. A vector which contains a 
polynucleotide described in claims 1-42. A cell or cell line which contans a polynucleotide 
described in claims M2. 

Any isolated or purified peptide or protein comprising an amino acid polymer that is 
25 a protease capable of cleaving the beta (p) secretase cleavage site of APP that contains two 
40 or more sets of special amino acids, where the special amino acids are separated by about 

100 to 300 amino acid positions, where each amino acid position can be any amino acid, 
where the first set of special amino acids consists of the peptide DTG, where the first amino 
acid of the first special set of amino acids is, the first special amino acid, where the second 

45 

30 set of amino acids is selected from the peptide comprising either DSG or DTG, where the 
last amino acid of the second set of special amino acids is the last special amino acid, with 
the proviso that the proteases disclosed in SEQ ED NO. 2 and SEQ. ID NO. 6 are not 
50 included. The amino acid polypeptide of claim 45 where the two sets of amino acids are 
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separated by about 125 to 222 amino acid positions where in each position it may be any 
5 amino acid. The amino acid polypeptide of claim 46 where the two sets of amino acids are 

separated by about 150 to 172 amino acids. The amino acid polypeptide of claim 47 where 
the two sets -of amino acids are separated by about 172 amino acids. The amino acid 
10 5 polypeptide of claim 48 where the protease is described in SEQ. ID. NO. 4 The amino acid 

polypeptide of claim 46 where the two sets of amino acids are separated by about 150 to 
1 96 amino acids. The amino acid polypeptide of claim 50 where the two sets of amino 
acids are separated by about 196 amino acids. The amino acid polypeptide of claim 51 
where the two sets of amino acids arc separated by the same amino acid sequences that 
10 separate the same set of special amino acids in SEQ. ID. NO. 6. The amino acid 

polypeptide of claim 46 where the two sets of amino acids are separated by about 150 to 
20 190, amino acids. The amino acid polypeptide of claim 53 where the two sets of 

nucleotides are separated by about 190 amino acids. The amino acid polypeptide of claim 
54 where the two sets of nucleotides are separated by the same amino acid sequences that 
15 separate the same set of special amino acids in SEQ. ID. NO. 2. Claims 45-55 where the 
first amino acid of the first special set of amino acids, that is, the first special amino acid, is 
operably linked to any peptide comprising from 1 to 10,000 amino acids. The amino acid 
polypeptide of claims 45-56 where the first special amino acid is operably linked to any 
30 peptide selected from the group consisting of: any any reporter proteins or proteins which 

20 facilitate purification. The amino acid polypeptide of claims 45-57 where the first special 
amino acid is operably linked to any peptide selected from the group consisting of: 
immunoglobin-heavy chain, maltose binding protein, glutathion S transfection, Green 
35 Fluorescent protein, and ubiquitin. Claims 45-58, where the last amino acid of the second 

set of special amino acids, that is, the last special amino acid, is operably linked to any 
25 peptide comprising any amino acids from 1 to 10,000 amino acids. Claims 45-59 where the 
40 last special amino acid is operably linked any peptide selected from the group consisting of 

any reporter proteins or proteins which facilitate purification. The amino acid polypeptide 
of claims 45-60 where the first special amino acid is operably linked to any peptide selected 
from the group consisting of: immunoglobin-heavy chain, maltose binding protein, 

45 

30 glutathion S transfection, Green Fluorescent protein, and ubiquitin. 

Any isolated or purified peptide or protein comprising an amino acid polypeptide 
that codes for a protease capable of cleaving the beta sccrctasc cleavage site of APP that 
50 contains two or more sets of special amino acids, where the special amino acids arc 
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separated by about 100 to 300 amino acid positions, where each amino acid in each position 

5 

can be any amino acid, where the first set of special amino acids consists of the amino acids 
DTG, where the first amino acid of the first special set of amino acids is, the first special 
amino acid, D, and where the second set of amino acids is either DSG or DTG, where the 
10 5 last amino acid of the second set of special amino acids is die last special amino acid, G, 

where the first special amino acid is operably Linked to amino acids that code for any 
number of amino acids from zero to 81 amino acid positions where in each position it may 
be any amino acid. The amino acid polypeptide of claim 62, where the first special amino 
acid is operably linked to a peptide from about 64 to 77 amino acids positions where each 
in amino acid position may be any amino acid. The amino acid polypeptide of claim 63, 

where the first special amino acid is operably linked to a peptide of 7 1 amino acids. The 
20 amino acid polypeptide of claim 64, where the first special amino acid is operably linked to 

71 amino acids and the first of those 71 amino acids is the amino acid T. The amino acid 
polypeptide of claim 65, where the polypeptide comprises a sequence that is at least 95% 
15 identical to SEQ. ID. (Example 1 1). The amino acid polypeptide of claim 66, where the 
complete polypeptide comprises SEQ. ID. (Example 1 1). The amino acid polypeptide of 
claim 62, where the first special amino acid is operably linked to any number of from 40 to 
54 amino acids (positions) where each amino acid position may be any amino acid. The 
30 amino acid polypeptide of claim 68, where the first special amino acid is operably linked to 

20 amino acids that code for a peptide of 47 amino acids. The amino acid polypeptide of claim 
69, where the first special amino acid is operably linked to a 47 amino acid peptide where 
the first those 47 amino acids is the amino acid E. The amino acid polypeptide of claim 70, 

35 

where the polypeptide comprises a sequence that is at least 95% identical to SEQ. ID. 
(Example 10). The amino acid polypeptide where the polypeptide comprises Example 10). 
25 Any isolated or purified amino acid polypeptide that is a protease capable of 

40 cleaving the beta ((3) secretase cleavage site of APP that contains two or more sets of 

special amino acids, where the special amino acids are separated by about 100 to 300 amino 
acid positions, where each amino acid in each position can be any amino acid, where the 
first set of special amino acids consists of the amino acids that code for DTG, where the 
30 first amino acid of the first special set of amino acids is, the first special amino acid, D, and 
where the second set of amino acids are either DSG or DTG, where the last amino acid of 
the second set of special amino acids is the last special amino acid, G, which is operably 
50 linked to any number of amino acids from 50 to 170 amino acids, which may be any amino 
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acids. The amino acid pol>peptide of claim 73 where the last special amino acid is 
operably linked to a peptide of about 100 to 170 amino acids. The amino acid polypeptide 
of claim 74 where the last special amino acid is operably linked to to a peptide of about 142 
to 163 amino acids. The amino acid polypeptide of claim 75 where the last special amino 
5 acid is operably linked to to a peptide of about about 142 amino acids. The amino acid 
polypeptide of claim 76 where the polypeptide comprises a sequence that is at least 95% 
identical to SEQ. ID. (Example 9 or 10). The amino acid polypeptide of claim 75 where 
the last special amino acid is operably linked to a peptide of about 163 amino acids. The 
amino acid polypeptide of claim 79 where the polypeptide comprises a sequence that is at 
10 least 95% identical to SEQ. ID. (Example 9 or 10). The amino acid polypeptide of claim 
79, where the complete polypeptide comprises SEQ. ID. (Example 9 or 10). The ammo 
20 acid polypeptide of claim 74 where the last special amino acid is operably linked to to a 

peptide of about 170 amino acids. Claim 46-8 1 where the second set of special amino acids 
is comprised of the peptide with the amino acid sequence DSG. Claims 45-82 where the 
15 amino acid polypeptide is operably linked to a peptide purification tag. Claims 45-83 
where the amino acid polypeptide is operably linked to a peptide purification tag which is 
six histidine. Claims 45-84 where the first set of special amino acids are on one 
polypeptide and the second set of special amino acids are on a second polypeptide, where 
both first and second polypeptide have at lease 50 amino acids, which may be any amino 
20 acids. Claims 45-84 where the first set of special amino acids are on one polypeptide and 
the second set of special amino acids are on a second polypeptide, where both first and 
second polypeptides have at lease 50 amino acids where both said polypeptides are in the 
same vessel. A vector which contains a polypeptide described in claims 45-86. A cell or 
cell line which contans a polynucleotide described in claims 45-87. The process of making 
25 any of the polynucleotides, vectors, or cells of claims 1-44. The process of making any of 
the polypeptides, vectors or cells of claims 45-88. Any of the polynucleotides, 
polypeptides, vectors, cells or cell lines described in claims 1-88 made from the processes 
described in claims 89 and 90. 

Any isolated or purified peptide or protein comprising an amino acid polypeptide 
30 that codes for a protease capable of cleaving the beta secretase cleavage site of APP that 
contains two or more sets of special amino acids, where the special amino acids are 
separated by about 100 to 300 amino acid positions, where each amino acid in each position 
50 can be any amino acid, where the first set of special amino acids consists of the amino acids 
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DTG, where the first amino acid of the first special set of amino acids is, the first special 

5 

amino acid, D, and where the second set of amino acids is either DSG or DTG, where the 
last amino acid of the second set of special ammo acids is the last special amino acid. G, 
where the "first special amino acid is operably linked to amino acids that code for any 
10 5 number of amino acids from zero to 81 amino acid positions where in each position it may 

be any amino acid. 

The amino acid polypeptide of claim 62, where the first special amino acid is 
operably linked to a peptide from about 30 to 77 amino acids positions where each amino 
acid position may be any amino acid. The amino acid polypeptide of claim 63, where the 
10 first special amino acid is operably linked to a peptide of 35, 47, 7 1 , or 77 amino acids. 

The amino acid polypeptide of claim 63, where the first special amino acid is 
20 operably linked to the same corresponding peptides from SEQ. ID. NO. 3 that are 35, 47, 

71, or 77 peptides in length, beginning counting with the amino acids on the first special 
sequence, DTG, towards the N-terminal of SEQ. ID. NO. 3. 
15 The amino acid polypeptide of claim 65, where the polypeptide comprises a 

sequence that is at least 95% identical to the same corresponding amino acids in SEQ. ID. 
NO. 4, that is, identical to that portion of the sequences in SEQ.ED. NO. 4, including all the 
sequences from both the first and or the second special nucleic acids, toward the N- 
30 terminal, through and including 71, 47, 35 amino acids before the first special amino acids. 

20 (Examples 10 and 11). 

The amino acid polypeptide of claim 65, where the complete polypeptide comprises 
the peptide of 71 amino acids, where the first of the amino acid is T and the second is Q. 
The nucleic acid polynucleotide of claim 21, where the polynucleotide comprises a 
sequence that is at least 95% identical to the same corresponding amino acids in SEQ. ID. 
25 NO. 3, that is, identical to the sequences in SEQ. ID. NO. 3 including the sequences from 
40 both the first and or the second special nucleic acids, toward the N-Tcrminal, through and 

including 71 amino acids, see Example 10, beginning from the DTG site and including the 
nucleotides from that code for 7 1 amino acids). 

The nucleic acid polynucleotide of claim 22, where the complete polynucleotide 

45 

30 comprises identical to the same corresponding amino acids in SEQ. ID. NO. 3, that is, 
identical to the sequences in SEQ. ID. NO. 3 including the sequences from both the first 
and or the second special nucleic acids, toward the N-Terminal, through and including 7 1 
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amino acids, sec Example 10, beginning from the DTG site and including the nucleotides 
5 from that code for 71 amino acids). 

The nucleic acid polynucleotide of claim 18 , where the first special nucleic acid is 
operably linked to nucleic acids that code for any number of from about 30 to 54 amino 
10 5 acids where each codon may code for any amino acid. 

The nucleic acid polynucleotide of claim 20, where the first special nucleic acid is 
operably linked to 47 codons where the first those 35 or 47 amino acids is the amino acid E 
orG. 

15 The nucleic acid polynucleotide of claim 2 1 , where the polynucleotide comprises a 

10 sequence that is at least 95% identical to the same corresponding amino acids in SEQ. ID. 
NO. 3, that is, identical to that portion of the sequences in SEQ. ID. NO. 3 including the 
2 Q sequences from both the first and or the second special nucleic acids, toward the N- 

Terminal, through and including 35 or 47 amino acids, see Example 1 1 for the 47 example, 
beginning from the DTG site and including the nucleotides from that code for the previous 
15 35 or 47 amino acids before the DTG site). The nucleic acid polynucleotide of claim 22, 
where the polynucleotide comprises identical to the same corresponding amino acids in 
SEQ. ID. NO. 3, that is, identical to the sequences in SEQ. ID. NO. 3 including the 
sequences from both the first and or the second special nucleic acids, toward the N- 
Terminal, through and including 35 or 47 amino acids, see Example 1 1 for the 47 example, 
20 beginning from the DTG site and including the nucleotides from that code for the previous 
35 or 47 amino acids before the DTG site). 

An isolated nucleic acid molecule comprising a polynucleotide, said polynucleotide 
35 encoding a Hu-Asp polypeptide and having a nucleotide sequence at least 95% identical to 

a sequence selected from the group consisting of: 
25 (a) a nucleotide sequence encoding a Hu-Asp polypeptide selected from 

4Q the group consisting of Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b), wherein said Hu-Aspl, 

Hu-Asp2(a) and Hu-Asp2(b) polypeptides have the complete amino acid sequence of SEQ 
ID No. 2, SEQ ID No. 4, and SEQ ID No. 6, respectively; and 

(b) a nucleotide sequence complementary to the nucleotide sequence 

45 30 of (a). 

The nucleic acid molecule of claim 92, wherein said Hu-Asp polypeptide is Hu- 
Asp 1, and said polynucleotide molecule of 1(a) comprises the nucleotide sequence of SEQ 
ID No. 1. The nucleic acid molecule of claim 92, wherein said Hu-Asp polypeptide is Hu- 
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Asp2(a), and said polynucleotide molecule of 1(a) comprises the nucleotide sequence of 
SEQ ID No. 4. The nucleic acid molecule of claim 92, wherein said Hu-Asp polypeptide is 
Hu-Asp2(b), and said polynucleotide molecule of 1(a) comprises the nucleotide sequence of 
SEQ ID No. 5. An isolated nucleic acid molecule comprising polynucleotide which 
hybridizes under stringent conditions to a polynucleotide having the nucleotide sequence in 
(a) or (b) of claim 92. A vector comprising the nucleic acid molecule of claim 96. The 
vector of claim 97, wherein said nucleic acid molecule is operably linked to a promoter for 
the expression of a Hu-Asp polypeptide. The vector of claim 98, wherein said Hu-Asp 
polypeptide is Hu-Asp 1. The vector of claim 98, wherein said Hu-Asp polypeptide is Hu- 
Asp2(a). The vector of claim 98, wherein said Hu-Asp polypeptide is Hu-Asp2(b). A host 
cell comprising the vector of claim 98. A method of obtaining a Hu-Asp polypeptide 
comprising culturing the host cell of claim 102 and isolating said Hu-Asp polypeptide. An 
isolated Hu-Asp 1 polypeptide comprising an amino acid sequence at least 95% identical to 
a sequence comprising the amino acid sequence of SEQ ID No. 2. An isolated Hu-Asp2(a) 
polypeptide comprising an amino acid sequence at least 95% identical to a sequence 
comprising the amino acid sequence of SEQ ID No. 4. An isolated Hu-Asp2(a) polypeptide 
comprising an amino acid sequence at least 95% identical to a sequence comprising the 
amino acid sequence of SEQ ID No. 8. An isolated antibody that binds specifically to the 
Hu-Asp polypeptide of any of claims 104-107. 

Here we disclose numerous methods to assay the enzyme. 

A method to identify a cell that can be used to screen for inhibitors of p 
secretase activity comprising: 

(a) identifying a cell that expresses a protease capable of cleaving APP at the P 
secretase site, comprising: 

i) collect the cells or the supernantent from the cells to be identified 

ii) measure the production of a critical peptide, where the critical 
peptide is selected from the group consisting of either the APP C- 
terminal peptide or soluble APP, 

iii) select the cells which produce the critical peptide. 

The method of claim 108 where the cells are collected and the critical peptide is the 
APP C-terminai peptide created as a result of the P secretase cleavage. The method of claim 
108 where the supernantent is collected and the critical peptide is soluble APP where the 
soluble APP has a C-terminal created by p secretase cleavage. The method of claim 108 
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where the cells contain any of the nucleic acids or polypeptides of claims 1-86 and where 
5 the cells are shown to cleave the (5 secretase site of any peptide having the following 

peptide structure, P2, PI, Pl\ P2\ where P2 is K or N, where PI is M or L, where PI' is 
D, where P2'- is A. The method of claim 1 1 1 where P2 is K and PI is M.. The method of 
10 5 claim 1 12 where P2 is N and PI is L. 

Any bacterial cell comprising any nucleic acids or peptides in claims 1-86 
and 92-107. A bacterial cell of claim 1 14 where the bacteria is E coll Any eukaryotic cell 
comprising any nucleic acids or polypeptides in claim;; 1-86 and 92-107. 
15 Any insect cell comprising any of the nucleic acids or polypeptides in claims 

10 1-86 and 92-107. A insect cell of claim 1 17 where the insect is srV,orHigh5. Ainsect 
cell of claim 100 where the insect cell is High 5, A mammalian cell comprising any of the 
20 nucleic acids or polypeptides in claims 1-86 and 92-107. A mammalian cell of claim 120 

where the mammalian cell is selected from the group consisting of, human, rodent, 
lagomorph, and primate. A mammalian cell of claim 121 where the mammalian cell is 
15 selected from the group consisting of human cell. A mammalian cell of claim 122 where 
the human cell is selected from the group comprising HEK293, and LMR-32. A 
mammalian cell of claim 121 where the cell is a primate cell. A primate cell of claim 124 
where the primate cell is a COS-7 cell. A mammalian cell of claim 121 where cell is 
selected from a rodent cells. A rodent cell of claim 126 selected from, CHOK1, Ncuro- 
20 2A, 3T3 cells. A yeast cell of claim 1 15. An avian cell of claim 1 15. 

Any isoform of APP where the last two carboxy terminus amino acids of that 
isoform are both lysine residues. In written descrip. Define isoform is any APP 
polypeptide, including APP variants (including mutations), and APP fragments that exists 
in humans such as those desribed in US 5,766,846, col 7, lines 45-67, incorporated into this 
25 document by reference. The isoform of APP from claim 1 14, comprising the isoform 
40 known as APP695 modified so that its last two having two lysine residues as its last two 

carboxy terminus amino acids. The isoform of claim 130 comprising SEQ. ID. 16. The 
isoform variant of claim 130 comprising SEQ. ID. NO. 18, and 20. Any eukaryotic cell 
line, comprising nucleic acids or polypeptides of claim 1 30- 1 32. Any cell line of claim 133 
30 that is a mammaliam cell line (HEK293, Neuro2a, best - plus others. A method for 
identifying inhibitors of an enzyme that cleaves the beta secretase cleavabe site of APP 
comprising: 
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a i culturing cells in a culture medium under conditions in which the enzyme 
causes processing of APP and release of amyloid beta-peptide into the medium and causes 
the accumulation of CTF99 fragments of APP in cell lysates, 

b) - exposing the cultured cells to a test compound; and specifically determining 
whether the test compound inhibits the function of the enzyme by measuring the amount of 
amyloid bcta-peptidc released into the medium and or the amount of CTF99 fragments of 
APP in cell lysates; 

c) identifying test compounds diminishing the amount of soluble amyloid beta 
peptide present in the culture medium and diminution of CTF99 fragments of APP in cell 
lysates as Asp2 inhibitors. 

The method of claim 135 wherein the cultured cells are a human, rodent or insect 
cell line. The method of claim 136 wherein the human or rodent cell line exhibits p 
secretase activity in which processing of APP occurs with release of amyloid beta-peptide 
into the culture medium and accumulation of CTF99 in cell lysates. A method as in claim 
137 wherein the human or rodent cell line treated with the antisense oligomers directed 
against the enzyme that exhibits p secretase activity, reduces release of soluble amyloid 
beta-peptide into the culture medium and accumulation of CTF99 in cell lysates. A method 
for the identification of an agent that decreases the activity of a Hu-Asp polypeptide 
selected from the group consisting of Hu-Asp 1, Hu-Asp2(a), and Hu-Asp2(b), the method 
comprising: 

a) determining the activity of said Hu-Asp polypeptide in the presence of a test 
agent and in the absence of a test agent; and 

b) comparing the activity of said Hu-Asp polypeptide determined in the 
presence of said test agent to the activity of said Hu-Asp polypeptide 
determined in the absence of said test agent; 

whereby a lower level of activity in the presence of said test agent than in the absence of said 
test agent indicates that said test agent has decreased the activity of said Hu-Asp 
polypeptide.. The nucleic acids, peptides, proteins, vectors, cells and cell lines, and assays 
described herein. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide that codes for a polypeptide selected from the group consisting of human 
aspartyl proteases, hi particular, human aspartyl protease 1 (Hu-Asp 1) and two alternative 
splice variants of human aspartyl protease 2 (Hu-Asp2), designated herein as Hu-Asp2(a) and 
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Hu-Asp2(b). As used herein, all references to "Hu-Asp" should be understood to refer to all of 
5 Hu-Asp I , Hu-Asp2(a), and Hu-Asp2(b). In addition, as used herein, all references to "Hu- 

Asp2" should be understood to refer to both Hu-Asp2(a) and Hu-Asp2(b). Hu-Asp 1 is 

expressed most abundantly in pancreas and prostate tissues, while Hu-Asp2(a) and Hu- 
1Q 5 Asp2(b) are expressed most abundantly in pancreas and brain tissues. The invention also 

provides isolated Hu-Asp 1, Hu-Asp2(a), and Hu-Asp2(b) polypeptides, as well as fragments 

thereof which exhibit aspartyl protease activity. 

In a preferred embodiment, the nucleic acid molecules comprise a polynucleotide 

15 

having a nucleotide sequence selected from the group consisting of residues 1-1554 of SEQ 
10 ID NO:l, encoding Hu-Aspl, residues 1-1503 of SEQ ID NO:3, encoding Hu-Asp2(a), and 
residues 1-1428 of SEQ ID NO:5, encoding Hu-Asp2(b). In another aspect, the invention 
2Q provides an isolated nucleic acid molecule comprising a polynucleotide which hybridizes 

under stringent conditions to a polynucleotide encoding Hu-Asp 1, Hu-Asp2(a), Hu-Asp-2(b), 
or fragments thereof. European patent application EP 0 848 062 discloses a polypeptide 
15 referred to as "Asp 1," that bears substantial homology to Hu-Aspl, while international 
25 application WO 98/22597 discloses a polypeptide referred to as "Asp 2," that bears substantial 

homology to Hu-Asp2(a), 

The present invention also provides vectors comprising the isolated nucleic acid 
30 molecules of the invention, host cells into which such vectors have been introduced, and 

20 recombinant methods of obtaining a Hu-Aspl, Hu-Asp2(a), or Hu-Asp2(b) polypeptide 
comprising culturing the above-described host cell and isolating the relevant polypeptide. 

In another aspect, the invention provides isolated Hu-Aspl, Hu-Asp2(a), and Hu- 
35 Asp2(b) polypeptides, as well as fragments thereof. In a preferred embodiment, the Hu-Aspl, 

Hu-Asp2(a), and Hu-Asp2(b) polypeptides have the amino acid sequence given in SEQ ID 
25 NO:2, SEQ ID NO:4, or SEQ ID NO:6, respectively. The present invention also desciibes 
40 active forms of Hu-Asp2, methods for preparing such active forms, methods for preparing 

soluble forms, methods for measuring Hu-Asp2 activity, and substrates for Hu-Asp2 cleavage. 
The invention also describes antisense oligomers targeting the Hu-Aspl, Hu-Asp2(a) and Hu- 
Asp2(b) mRNA transcripts and the use of such antisense reagents to decrease such mRNA 
45 30 and consequently the production of the corresponding polypeptide. Isolated antibodies, both 

polyclonal and monoclonal, that binds specifically to any of the Hu-Aspl, Hu-Asp2(a), and 
Hu-Asp2(b) polypeptides of the invention are also provided. 

50 
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The invention also provides a method for the identification of an agent that modulates 
5 the activity of any of Hu-Asp-1, Hu-Asp2(a), and Hu-Asp2(b). The inventions describes 

methods to test such agents in cell-tree assays to which Hu-Asp2 polypeptide is added, as well 
as methods to test such agents in human or other mammalian cells in which Hu-Asp2 is 
5 present. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTINGS 

Sequence ID No. 1 — Human Asp-1, nucleotide sequence 
Sequence ID No. 2 — Human Asp-1, predicted amino acid sequence 
15 Sequence CD No. 3 — Human Asp-2(a), nucleotide sequence 

10 Sequence ID No. 4 — Human Asp-2(a), predicted amino acid sequence 
Sequence ID No. 5 — Human Asp-2(b), nucleotide sequence 
Sequence ED No. 6 — Human Asp-2(b), predicted amino acid sequence 
Sequence ED No. 7 — Murine Asp-2(a), nucleotide sequence 
20 Sequence ED No. 8 — Murine Asp-2(a), predicted amino acid sequence 

15 Sequence ID No. 9 — Human APP695, nucleotide sequence 

Sequence ID No. 10 — Human APP695, predicted amino acid sequence 
Sequence ID No.l 1 — Human APP695-Sw, nucleotide sequence 
Sequence ID No.l 2 — Human APP695-Sw. predicted amino acid sequence 
25 Sequence ID No. 1 3 — Human APP695- VF, nucleotide sequence 

20 Sequence ID No. 14 — Human APP695- VF, predicted amino acid sequence 
Sequence ID No. 1 5 — Human APP695-KK, nucleotide sequence 
Sequence ID No. 16— Human APP695-KK t predicted amino acid sequence 
Sequence ID No. 17 — Human APP695-Sw-KK, nucleotide sequence 
Sequence ID No. 18 — Human APP695-Sw-KK, predicted amino acid sequence 
25 Sequence ED No. L9 — Human APP695-VF-KK, nucleotide sequence 

Sequence ID No.20— Human APP695-VF-KK, predicted amino acid sequence 
Sequence ID No.21 — T7-Human-pro-Asp-2(a)ATM, nucleotide sequence 
Sequence ID No.22 — T7-Human-pro-Asp-2(a)ATM, amino acid sequence 
Sequence ID No.23 — T7-Caspase-Human-pro-Asp-2(a)ATM, nucleotide sequence 
30 Sequence ID No. 24 — T7-Caspase-Human-pro-Asp-2(a)ATM, amino acid sequence 
Sequence ID No.25 — Human-pro-Asp-2(a)ATM (low GC), nucleotide sequence 
Sequence ID No.26 — Human-pro-Asp-2(a)ATM, (low GC), amino acid sequence 
Sequence ID No.27 — T7-Caspase-Caspase 8 cleavage-Human-pro-Asp-2(a)ATM, 
nucleotide sequence 

35 Sequence ID No.28 — T7-Caspase Caspase 8 cleavage~Human-pro-Asp-2(a)ATM, amino 
acid sequence 

Sequence ID No.29 — Human Asp-2(a)ATM, nucleotide sequence 
Sequence ED No.30 — Human Asp-2(a)ATM, amino acid sequence 
Sequence ID No.31 — Human Asp-2( a) ATM(His>6 , nucleotide sequence 
40 Sequence ED No.32 — Human Asp-2(a)ATM(His) 6( amino acid sequence 

Sequence ID No.s 33-46 are described below in the Detailed Description of the Invention. 
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Figure 1: Figure 1 shows the nucleotide (SEQ ID NO: I) and predicted amino 
acid sequence (SEQ ID NO:2) of human Aspl. 

Figure 2: Figure 2 shows the nucleotide (SEQ ID NO:3) and predicted amino 
acid sequence (SEQ ID NO:4) of human Asp2(a). 

Figure 3: Figure 3 shows the nucleotide (SEQ ID NO:5) and predicted amino 
acid sequence (SEQ ID NO:6) of human Asp2(b). The predicted transmembrane domain of 
Hu-Asp2(b) is enclosed in brackets. 

Figure 4: Figure 4 shows the nucleotide (SEQ ID No. 1) and predicted amino 
acid sequence (SEQ ID No. 8) of murine Assp2(a) 

Figure 5: Figure 5 shows the BestFit alignment of the predicted amino acid 
sequences of Hu-Asp2(a) and murine Asp2(a) 

Figure 6: Figure 6 shows the nucleotide (SEQ ID No. 21) and predicted amino 
acid sequence (SEQ ID No. 22) of T7-Human-pro-Asp-2(a)ATM 

Figure7: Figure 7 shows the nucleotide (SEQ ID No. 23) and predicted amino 
acid sequence (SEQ ID No. 24) of T7-caspase-Human-pro- Asp-2(a)ATM 

Figure 8: Figure 8 shows the nucleotide (SEQ ID No. 25) and predicted amino 
acid sequence (SEQ ID No. 26) of Human-pro- Asp-2 (a) ATM (low GC) 

Figure 9: Western blot showing reduction of CTF99 production by IIEK 125.3 
cells transfected with antisense oligomers targeting the Hu-Asp2 Mrna 

Figure 10: Western blot showing increase in CTF99 production in mouse 
Neuro-2a cells cotransfected with APP-KK with and without Hu-Asp2 only in those cells 
cotransfected with Hu-Asp2. A further increase in CTF99 production is seen in cells 
cotransfected with APP-Sw-KK with and without Hu-Asp2 only in those cells cotransfected 
with Hu-Asp2 

Figure 1 1: Figure 1 1 shows the predicted amino acid sequence (SEQ ID No. 30) 
of Human- Asp2< a)ATM 

Figure 12: Figure 11 shows the predicted amino acid sequence (SEQ ID No. 30) 
of Human- Asp2( a) ATM(His)6 

DETAILED DESCRIPTION OF THE INVENTION 

A few definitions u^ed in this invention follow, most definitions to be used are those 
that would be used by one ordinarily skilled in the art. 

When the p amyloid peptide any peptide resulting from beta secretase cleavage of 
APP. This includes, peptides of 39, 40, 41, 42 and 43 amino acids, extending from the p- 
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secretase cleavage site to 39, 40, 41, 42 and 43 amino acids, p amyloid peptide also means 
5 sequences 1-6, SEQ. ID. NO. 1-6 of US 5,750,349, issued 12 May 1998 (incorporated into 

this document by reference). A p-secretase cleavage fragment disclosed here is called CTF- 
99, which extends from p-secretase cleavage site to the caiboxy terminus of APP. 
10 5 When an isoform of APP is discussed then what is meant is any APP polypeptide, 

including APP variants (including mutations), and APP fragments that exists in humans 
such as those desribed in US 5,766,846, col 7, lines 45-67, incorporated into this document 
by reference and see below. 

The term u p-amyloid precursor protein" (APP) as used herein is defined as a 
10 polypeptide that is encoded by a gene of the same name localized in humans on the long 
arm of chromosome 21 and that includes "PAP - here "p-arnyloid protein" see above, 
20 within its carboxyl third. APP is a glycosylated, single-membrane spanning protein 

expressed in a wide variety of cells in many mammaliam tissues. Examples of specific 
isotypes of APP which are currently known to exist in humans arc the 695-amino acid 
15 polypeptide described by Kang et. al. (1987) Nature 325:733-736 which is designated as the 
"normal" APP; the 751 -amino acid polypeptide described by Ponte et al. (1988) Nature 
331:525-527 (1988) and Tanzi et al. (1988) Nature 331:528-530; and the 770-amino acid 
polypeptide described by Kitaguchi eL al. (1988) Nature 331:530-532. Examples of specific 
30 variants of APP include point mutation which can differ in both position and phenotype (for 

20 review of known variant mutation see Hardy (1992) Nature Genet. 1:233-234). All 

references cited here incorporated by reference. The term "APP fragments" as used herein 
refers to fragments of APP other than those which consist solely of PAP or PAP fragments. 
That is, APP fragments will include amino acid sequences of APP in addition to those 
which form intact 3AP or a fragment of pAP. 
25 When the term "any amino acid 1 ' is used, the amino acids referred to are to be selected 

40 fr om t he following, three letter and single letter abbreviations - which may also be used, are 

provided as follows: 

Alanine, Ala, A; Arginine, Arg, R; Asparagine, Asn, N; Aspartic acid, Asp, D; 
Cystein, Cys, C; Glutamine, Gin, Q; lu;E-Glutamic Acid, Glu, E; Glycine, Gly, G; 

45 

30 Histidine, His, H; Isoleucine, lie, I; Leucine, Leu, L; Lysine, Lys, K; Methionine, Met, 
M; Phenylalanine, Phe, F; Proline, Pro, P; Serine, Ser, S; Threonine, Thr, T; Tryptophan, 
Trp, W; Tyrosine, Tyr, Y; Valine, Val, V; Aspartic acid or Asparagine, Asx, B; Glutamic 
50 acid or Glutamine, Glx, Z; Any amino acid, Xaa, X.. 
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The present invention describes a method to scan gene databases for the simple 

5 active site motif characteristic of aspartyl proteases. Eukaryotic aspartyl proteases such as 

pepsin and renin possess a two-domain structure which folds to bring two aspartyl residues 
into proximity within the active site. These are embedded in the short tripeptide motif 

w 5 DTG, or more rarely, DSG. Most aspartyl proteases occur as proenzyme whose N-terrninus 

must be cleaved for activation. The DTG or DSG active site motif appears at about residue 
65-70 in the proenzyme (prorenin, pepsinogen), but at about residue 25-30 in the active 
enzyme after cleavage of the N-terminal prodomain. The limited length of the active site 

15 motif makes it difficult to search collections of short, expressed sequence tags (EST) for 

10 novel aspartyl proteases. EST sequences typically average 250 nucleotides or less, and so 
would encode 80-90 amino acid residues or less. That would be too short a sequence to 

2Q span the two active site motifs. The preferred method is to scan databases of hypothetical 

or assembled protein coding sequences. The present invention describes a computer 
method to identify candidate aspartyl proteases in protein sequence databases. The method 
15 was used to identify seven candidate aspartyl protease sequences in the Caenorhabditis 

25 

elegans genome. These sequences were then used to identify by homology search Hu-Aspl 
and two alternative splice variants of Hu-Asp2, designated herein as Hu-Asp2(a) and Hu- 
Asp2(b). 

30 In a major aspect of the invention disclosed here we provide new information about 

20 APP processing. Pathogeneic processing of the amyloid precursor protein (APP) via the 
Ap pathway requires the sequential action of two proteases referred to as p-secretase and y- 
secretase. Cleavage of APP by the p-secretase and y-secretase generates the N-terrninus 
and C-terminus of the Ap peptide, respectively. Because over production of the Ap 
peptide, particularly the Api-42, has been implicated in the initiation of Alzheimer's disease, 
25 inhibitors of either the P-secretase and/or the y-secretase have potential in the treatment of 

40 Alzheimer's disease. Despite the importance of the p-secretase and y-secretase in the 

pathogenic processing of APP, molecular definition of these enzymes has not been 
accomplished to date. That is, it was not known what enzymes were required for cleavage 
at either the B-secretase or the y-secretase cleavage site. The sites themselves were 

45 pie 

30 known because APP was known and the ApM 2 , peptide was known, see US 5,766,846 and 
US 5,837,672, (incorporated by reference, with the exception to reference to "soluble" 
peptides). But what enzyme was involved in producing the Api^ peptide was unknown. 
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The present invention involves the molecular definition of several novel human 
aspartyl proteases and one of these, referred to as Hu-Asp-2(a) and Hu-Asp2(b), has been 
characterized in detail. Previous forms of aspl and asp 2 have been disclosed, see EP 
0848062 A2 and EP 0855444A2, inventors David Powel et. uL assigned to Smith Kline 
5 Beecham Corp. (incorporated by reference). Herein are disclosed old and new forms of 
Hu-Asp 2. For the first time they are expressed in active form, their substrates are 
disclosed, and their specificity is disclosed. Prior to this disclosure cell or cell extracts were 
required to cleave the (3-secretase site, now purified protein can be used in assays, also 
described here. Based on the results of (1) antisense knock out experiments, (2) transient 
10 transfection knock in experiments, and (3) biochemical experiments using purified 

recombinant Hu-Asp-2, we demonstrate that Hu-Asp-2 is the P-secretase involved in the 
20 processing of APP. Although the nucleotide and predicted amino acid sequence of Hu- 

Asp-2(a) has been reported, see above, see EP 0848062 A2 and EP 0855444A2, no 
functional characterization of the enzyme was disclosed. Here the authors characterize the 
15 Hu-Asp-2 enzyme and are able to explain why it is a critical and essential enzyme required 
in the formation of AP1-42, peptide and possible a critical step in the development of AD. 

In another embodiment the present invention also describes a novel splice variant of 
Hu-Asp2, referred to as Hu-Asp-2(b). that has never before been disclosed. 
30 In another embodiment, the invention provides isolated nucleic acid molecules 

20 comprising a polynucleotide encoding a polypeptide selected from the group consisting of 
human aspartyl protease 1 (Hu-Aspl) and two alternative splice variants of human aspartyl 
protease 2 (Hu-Asp2), designated herein as Hu-Asp2(a) and Hu-Asp2(b). As used herein, all 
references to "Hu-Asp2" should be understood to refer to both Hu-Asp2(a) and Hu-Asp2(b). 
Hu-Aspl is expressed most abundantly in pancreas and prostate tissues, while Hu-Asp2(a) 
25 and Hu-Asp2(b) are expressed most abundantly in pancreas and brain tissues. The invention 
40 also provides isolated Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b) polypeptides, as well as 

fragments thereof which exhibit aspartyl protease activity. 

The predicted amino acid sequences of Hu-Aspl, Hu-Asp2(a) and Hu-Asp2(h) share 
significant homology with previously identified mammalian aspartyl proteases such as 
30 pepsinogen A, pepsinogen B, cathepsin D, cathepsin E, and renin. P.B.Szecs, Scand. J. Clin. 
Lab. Invest. 52:(Suppl. 210 5-22 (1992)). These enzymes are characterized by the presence of 
a duplicated DTG/DSG sequence motif. The Hu-Aspl and HuAsp2 polypeptides disclosed 
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herein also exhibit extremely high homology with the ProSite consensus motif for aspartyl 
proteases extracted from the SwissProt database. 

The nucleotide sequence given as residues 1-1554 of SEQ ID NO:l corresponds to 
the nucleotide sequence encoding Hu-Aspl, the nucleotide sequence given as residues 1-1503 
5 of SEQ ID NO:3 corresponds to the nucleotide sequence encoding Hu-Asp2(a), and the 
nucleotide sequence given as residues 1-1428 of SEQ ID NO:5 corresponds to the nucleotide 
sequence encoding Hu-Asp2(b). The isolation and sequencing of DNA encoding Hu-Aspl, 
Hu-Asp2(a), and Hu-Asp2(b) is described below in Examples 1 and 2. 

As is described in Examples 1 and 2 t automated sequencing methods were used to 
10 obtain the nucleotide sequence of Hu-Aspl, Hu-Asp2(a), and Hu-Asp-2(b). The Hu-Asp 
nucleotide sequences of the present invention were obtained for both DNA strands, and are 
believed to be 100% accurate. However, as is known in the art, nucleotide sequence obtained 
by such automated methods may contain some errors. Nucleotide sequences determined by 
automation are typically at least about 90%, more typically at least about 95% to at least about 
15 99.9% identical to the actual nucleotide sequence of a given nucleic acid molecule. The 
actual sequence may be more precisely determined using manual sequencing methods, which 
are well known in the art. An error in sequence which results in an insertion or deletion of 
one or more nucleotides may result in a frame shift in translation such that the predicted 
amino acid sequence will differ from that which would be predicted from the actual 
20 nucleotide sequence of the nucleic acid molecule, starting at the point of the mutation. The 
Hu-Asp DNA of the present invention includes cDNA, chemically synthesized DNA, DNA 
isolated by PCR, genomic DNA, and combinations thereof. Genomic Hu-Asp DNA may be 
obtained by screening a genomic library with the Hu-Asp2 cDNA described herein, using 
methods that are well known in the art, or with oligonucleotides chosen from the Hu-Asp2 
25 sequence that will prime the polymerase chain reaction (PCR), RNA transcribed from Hu- 
40 Asp DNA is also encompassed by the present invention. 

Due to the degeneracy of the genetic code, two DNA sequences may differ and yet 
encode identical amino acid sequences. The present invention thus provides isolated nucleic 
acid molecules having a polynucleotide sequence encoding any of the Hu-Asp polypeptides of 
30 the invention, wherein said polynucleotide sequence encodes a Hu-Asp polypeptide having 
the complete amino acid sequence of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, or 
fragments thereof. 
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Also provided herein are purified Hu-Asp polypeptides, both recombinant and non- 
recombinant. Most importantly, methods to produce Hu-Asp2 polypeptides in active form are 
provided. These include production of Hu-Asp2 polypeptides and variants thereof in bacterial 
cells, insect cells, and mammalian cells, also in forms that allow secretion of the Hu-Asp2 
5 polypeptide from bacterial, insect or mammalian cells into the culture medium, also methods 
to produce variants of Hu-Asp2 polypeptide incorporating amino acid tags that facilitate 
subsequent purification. In a preferred embodiment of the invention the Hu-Asp2 polypeptide 
is converted to a proteolytically active form either in transformed cells or after purification 
and cleavage by a second protease in a cell-free system, such active forms of the Hu-Asp 2 
10 polypeptide beginning with the N-terminal sequence TQHGIR or ETDEEP. Variants and 
derivatives, including fragments, of Hu-Asp proteins having the native amino acid sequences 
given in SEQ ID Nos: 2, 4, and 6 that retain any of the biological activities of Hu-Asp are also 
within the scope of the present invention. Of course, one of ordinary skill in the art will 
readily be able to determine whether a variant, derivative, or fragment of a Hu-Asp protein 
15 displays Hu-Asp activity by subjecting the variant, derivative, or fragment to a standard 
aspartyl protease assay. Fragments of Hu-Asp within the scope of this invention include those 
that contain the active site domain containing the amino acia sequence DTG, fragments that 
contain the active site domain amino acid sequence DSG, fragments containing both the DTG 
and DSG active site sequences, fragments in which the spacing of the DTG and DSG active 
20 site sequences has been lengthened, fragments in which the spacing has been shortened. Also 
within the scope of the invention are fragments of Hu-Asp in which the transmembrane 
domain has been removed to allow production of Hu-Asp2 in a soluble form. In another 
35 embodiment of the invention, the two halves of Hu-Asp2, each containing a single active site 

DTG or DSG sequence can be produced independently as recombinant polypeptides, then 
25 combined in solution where they reconstitute an active protease. 

Hu-Asp variants may be obtained by mutation of native Hu-Asp-encoding nucleotide 
sequences, for example. A Hu-Asp variant, as referred to herein, is a polypeptide 
substantially homologous to a native Hu-Asp polypeptide but which has an amino acid 
sequence different from that of native Hu-Asp because of one or more deletions, insertions, or 
30 substitutions in the amino acid sequence. The variant amino acid or nucleotide sequence is 
preferably at least about 80% identical, more preferably at least about 90% identical, and most 
preferably at least about 95% identical, to a native Hu-Asp sequence. Thus, a variant 
nucleotide sequence which contains, for example, 5 point mutations for every one hundred 
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nucleotides, as compared to a native Hu-Asp gene, will be 95% identical to the native protein. 
The percentage of sequence identity, also termed homology, between a native and a variant 
Hu-Asp sequence may also be determined, for example, by comparing the two sequences 
using any of- the computer programs commonly employed for this purpose, such as the Gap 
program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer 
Group, University Research Park, Madison Wisconsin), which uses the algorithm of Smith 
and Waterman (Adv. AppL Math, 2: 482-489 (1981)). 

Alterations of the native amino acid sequence may be accomplished by any of a 
number of known techniques. For example, mutations may be introduced at particular 
locations by procedures well known to the skilled artisan, such as oligonucleotide-directed 
mutagenesis, which is described by Walder et al (Gene 42:133 (1986)); Bauer et al. (Gene 
37:12 (1985)); Craik (BioTechniques, January 1985, pp. 12-19); Smith et al (Genetic 
Engineering: Principles and Methods , Plenum Press (1981)); and U.S. Patent Nos. 4,518,584 
and 4,737,462. 

Hu-Asp variants within the scope of the invention may comprise conservatively 
substituted sequences, meaning that one or more amino acid residues of a Hu-Asp polypeptide 
are replaced by different residues that do not alter the secondary and/or tertiary structure of the 
Hu-Asp polypeptide. Such substitutions may include the replacement of an amino acid by a 
residue having similar physic ochemical properties, such as substituting one aliphatic residue 
(He, Val, Leu or Ala) for another, or substitution between basic residues Lys and Arg, acidic 
residues Glu and Asp, amide residues Gin and Asn, hydroxyl residues Ser and Tyr, or 
aromatic residues Phe and Tyr. Further information regarding making phcnotypically silent 
amino acid exchanges may be found in Bowie et ai t Science 247:1306-1310 (1990). Other 
Hu-Asp variants which might retain substantially the biological activities of Hu-Asp are those 
where amino acid substitutions have been made in areas outside functional regions of the 
protein, 

In another aspect, the invention provides an isolated nucleic acid molecule comprising 
a polynucleotide which hybridizes under stringent conditions to a portion of the nucleic acid 
molecules described above, e.g., to at least about 15 nucleotides, preferably to at least about 
20 nucleotides, more preferably to at least about 30 nucleotides, and still more preferably to at 
least about from 30 to at least about 100 nucleotides, of one of the previously described 
nucleic acid molecules. Such portions of nucleic acid molecules having the described lengths 
refer to, e.g., at least about 15 contiguous nucleotides of the reference nucleic acid molecule. 
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By stringent hybridization conditions is intended overnight incubation at about 42°C for about 
5 2.5 hours in 6 X SSC/0.1% SDS, followed by washing of the filters in 1.0 X SSC at 65°C, 

0.1% SDS. 

Fragments of the Hu- Asp-encoding nucleic acid molecules described herein, as well as 
10 5 polynucleotides capable of hybridizing to such nucleic acid molecules may be used as a probe 

or as primers in a polymerase chain reaction (PCR). Such probes may be used, e.g., to detect 
the presence of Hu-Asp nucleic acids in in vitro assays, as well as in Southern and northern 
blots. Cell types expressing Hu-Asp may also be identified by the use of such probes. Such 
procedures are well known, and the skilled artisan will be able to choose a probe of a length 
10 suitable to the particular application. For PCR, 5' and 3' primers corresponding to the termini 
of a desired Hu-Asp nucleic acid molecule are employed to isolate and amplify that sequence 
20 using conventional techniques. 

Other useful fragments of the Hu-Asp nucleic acid molecules are antisense or sense 
oligonucleotides comprising a single-stranded nucleic acid sequence capable of binding to a 
15 target Hu-Asp mRNA (using a sense strand), or Hu-Asp DNA (using an antisense strand) 
sequence. In a preferred embodiment of the invention these Hu-Asp antisense 
oligonucleotides reduce Hu-Asp mRNA and consequent production of Hu-Asp polypeptides. 
In another aspect, the invention includes Hu-Asp polypeptides with or without 
3Q associated native pattern glycosylation. Both Hu-Asp 1 and Hu-Asp2 have canonical acceptor 

20 sites for Asn-linked sugars, with Hu-Asp 1 having two of such sites, and Hu-Asp2 having four. 
Hu-Asp expressed in yeast or mammalian expression systems (discussed below) may be 
similar to or significantly different from a native Hu-Asp polypeptide in molecular weight 
35 and glycosylation pattern. Expression of Hu-Asp in bacterial expression systems will provide 

non-glycosylated Hu-Asp. 
25 The polypeptides of the present invention are preferably provided in an isolated form, 

40 and preferably are substantially purified. Hu-Asp polypeptides may be recovered and purified 

from tissues, cultured cells, or recombinant cell cultures by well-known methods, including 
ammonium sulfate or ethanol precipitation, anion or cation exchange chromatography, 
phosphocellulose chromatography, hydrophobic interaction chromatography, affinity 
45 30 chromatography, hydroxylapatite chromatography, lectin chromatography, and high 

performance liquid chromatography (HPLC). In a preferred embodiment, an amino acid tag is 
added to the Flu-Asp polypeptide using genetic engineering techniques that arc well known to 
50 practioners of the art which include addition of six histidine amino acid residues to allow 
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purification by binding to nickel immobilized on a suitable support, epitopes for polyclonal or 
monoclonal antibodies including but not limited to the T7 epitope, the myc epitope, and the 
V5a epitope, and fusion of Hu-Asp2 to suitable protein partners including but not limited to 
glutathione-S- transferase or maltose binding prolien. In a preferred embodiment these 
5 additional amino acid sequences arc added to the C-terminus of Hu-Asp but may be added to 
the N-tenniuus or at intervening positions within the Hu-Asp2 polypeptide. 

The present invention also relates to vectors comprising the polynucleotide molecules 
of the invention, as well as host cell transformed with such vectors. Any of the polynucleotide 
molecules of the invention may be joined to a vector, which generally includes a selectable 
10 marker and an origin of replication, for propagation in a host. Because the invention also 
provides Hu-Asp polypeptides expressed from the polynucleotide molecules described above, 
20 vectors for the expression of Hu-Asp are preferred. The vectors include DNA encoding any of 

the Hu-Asp polypeptides described above or below, operably linked to suitable transcriptional 
or translational regulatory sequences, such as those derived from a mammalian, microbial, 
15 viral, or insect gene. Examples of regulatory sequences include transcriptional promoters, 
operators, or enhancers, mRNA ribosomal binding sites, and appropriate sequences which 
control transcription and translation. Nucleotide sequences are operably linked when the 
regulatory sequence functionally relates to the DNA encoding Hu-Asp. Thus, a promoter 
nucleotide sequence is operably linked to a Hu-Asp DNA sequence if the promoter nucleotide 
20 sequence directs the transcription of the I lu-Asp sequence. 

Selection of suitable vectors to be used for the cloning of polynucleotide molecules 
encoding Hu-Asp, or for the expression of Hu-Asp polypeptides, will of course depend upon 
35 the host cell in which the vector will be transformed, and, where applicable, the host cell from 

which the Hu-Asp polypeptide is to be expressed. Suitable host cells for expression of Hu- 
25 Asp polypeptides include prokaryotes, yeast, and higher eukaryotic cells, each of which is 
4 q discussed below. 

The Hu-Asp polypeptides to be expressed in such host cells may also be fusion 
proteins which include regions from heterologous proteins. Such regions may be included to 
allow, e.g., secretion, unproved stability, or facilitated purification of the polypeptide. For 
45 30 example, a sequence encoding an appropriate signal peptide can be incorporated into 

expression vectors. A DNA sequence for a signal peptide (secretory leader) may be fused 
in frame to the Hu-Asp sequence so that Hu-Asp is translated as a fusion protein comprising 
50 the signal peptide. A signal peptide that is functional in the intended host cell promotes 
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extracellular secretion of the Hu-Asp polypeptide. Preferably, the signal sequence will be 
cleaved from the Hu-Asp polypeptide upon secretion of Hu-Asp from the cell. Non-limiting 
examples of signal sequences that can be used in practicing the invention include the yeast 
l-factor and the honeybee melatin leader in sP9 insect cells. 

In a preferred embodiment, the Hu-Asp polypeptide will be a fusion protein which 
includes a heterologous region used to facilitate purification of the polypeptide. Many of the 
available peptides used for such a function allow selective binding of the fusion protein to a 
binding partner. For example, the Hu-Asp polypeptide may be modified to comprise a 
peptide to form a fusion protein which specifically binds to a binding partner, or peptide tag. 
Non-limiting examples of such peptide tags include the 6-His tag, thioredoxin tag, 
hemaglutinin tag, GST tag, and OmpA signal sequence tag. As will be understood by one of 
skill in the art, the binding partner which recognizes and binds to the peptide may be any 
molecule or compound including metal ions (e.g., metal affinity columns), antibodies, or 
fragments thereof, and any protein or peptide which binds the peptide, such as the FLAG tag. 

Suitable host cells for expression of Hu-Asp polypeptides includes prokaryotes, yeast, 
and higher eukaryotic cells. Suitable prokaryotic hosts to be used for the expression of Hu- 
Asp include bacteria of the genera Escherichia, Bacillus, and Salmonella, as well as members 
of the genera Pscudomonas, Streptomyces, and Staphylococcus. For expression in, e.g., E. 
coli, a Hu-Asp polypeptide may include an N-terminal methionine residue to facilitate 
expression of the recombinant polypeptide in a prokaryotic host. The N-terminai Met may 
optionally then be cleaved from the expressed Hu-Asp polypeptide. Other N-terminal amino 
acid residues can be added to the Hu-Asp polypeptide to facilitate expression in Escherichia 
coli including but not limited to the 17 leader sequence, the T7-caspase 8 leader sequence, as 
well as others leaders including tags for purification such as the 6-His tag (Example 9). Hu- 
Asp polypeptides expressed in E. coli may be shortened by removal of the cytoplasmic tail, 
the transmembrane domain, or the membrane proximal region. Hu-Asp polypeptides 
expressed in E. coli may be obtained in either a soluble form or as an insoluble form which 
may or may not be present as an inclusion body. The insoluble polypeptide may be rendered 
soluble by guanidine HC1, urea or other protein denaturants, then refolded into a soluble form 
before or after purification by dilution or dialysis into a suitable aqueous buffer. If the 
inactive pro form of the Hu-Asp was produced using recombinant methods, it may be rendered 
active by cleaving off the prosegment with a second suitable protease such as human 
immunodeficiency virus protease. 
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Expression vectors for use in prokaryotic hosts generally comprises one or more 
phenotypic selectable marker genes. Such genes generally encode, e.*., a protein that confers 
antibiotic resistance or that supplies an auxotrophic requirement. A wide variety of such 
vectors are readily available from commercial sources. Examples include pSPORT vectors, 

5 pGEM vectors (Promega), pPROEX vectors (LTI, Bethesda, MD), Bluescnpt vectors 
(Stratagenc), pET vectors (Novagen) and pQE vectors (Qiagen). 

Hu-Asp may also be expressed in yeast host cells from genera including 
Saccharomyves, Pichia, and Kluveromyces. Preferred yeast hosts arc S. cerevisiae and P. 
pastoris. Yeast vectors will often contain an origin of replication sequence from a 21 yeast 

10 plasmid, an autonomously replicating sequence (ARS), a promoter region, sequences for 
polyadenylation, sequences for transcription termination, and a selectable marker gene. 
Vectors replicable in both yeast and E. coli (termed shuttle vectors) may also be used. In 
addition to the above-mentioned features of yeast vectors, a shuttle vector will also include 
sequences for replication and selection in E. coli. Direct secretion of Hu-Asp polypeptides 

15 expressed in yeast hosts may be accomplished by the inclusion of nucleotide sequence 
encoding the yeast I-factor leader sequence at the 5' end of the Hu-Asp-encoding nucleotide 
sequence. 

Insect host cell culture systems may also be used for the expression of Hu-Asp 
polypeptides. In a preferred embodiment, the Hu-Asp polypeptides of the invention are 
20 expressed using an insect cell expression system (see Example 10). Additionally, a 
baculovirus expression system can be used for expression in insect cells as reviewed by 
Luckow and Summers, Bio/Technology 6:47 (1988). 

In another preferred embodiment, the Hu-Asp polypeptide is expressed in mammalian 
host cells. Non-limiting examples of suitable mammalian cell lines include the COS-7 line of 
25 monkey kidney cells (Gluzman et al, Cell 23:175 (1981)). human embyonic kidney cell line 
293, and Chinese hamster ovary (CHO) cells. Preferably, Chinese hamster ovary (CHO) cells 
are used for expression of Hu-Asp proteins (Example 11). 

The choice of a suitable expression vector for expression of the Hu-Asp polypeptides 
of the invention will of course depend upon the specific mammalian host cell to be used, and 
30 is within the skill of the ordinary artisan. Examples of suitable expression vectors include 
pcDNA3 (Invitrogen) and pSVL (Pharmacia Biotech). A preferred vector for expression of 
Hu-Asp polypeptides is pcDNA3.1-Hygro (Invitrogen). Expression vectors for use in 
mammalian host cells may include transcriptional and transnational control sequences derived 
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from viral genomes. Commonly used promoter sequences and enhancer sequences wliich 
may be used in the present invention include, but are not limited to, those derived from human 
cytomegalovirus (CMV), Adenovirus 2, Polyoma virus, and Simian virus 40 (SV40). 
Methods for -the construction of mammalian expression vectors are disclosed, for example, in 
5 Okayama and Berg {Uol Cell. Biol. 3\2%0 (1983)); Cosman et al (Mol Immunol 2i:935 
(1986)); Cosman et al {Nature 312:16% (1984)); EP-A-0367566; and WO 91/18982. 

The polypeptides of the present invention may also be used to raise polyclonal and 
monoclonal antibodies, which are useful in diagnostic assays for detecting Hu-Asp 
polypeptide expression. Such antibodies may be prepared by conventional techniques. See, 
10 for example, Antibodies: A Laboratory Manual, Harlow and Land (eds.), Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N.Y., (1988); Monoclonal Antibodies, Hybridomas: A 
New Dimension in Biological Analyses, Kcnnct et al (cds.), Plenum Press, New York (19S0). 
Synthetic peptides comprising portions of Hu-Asp containing 5 to 20 amino acids may also be 
used for the production of polyclonal or monoclonal antibodies after linkage to a suitable 
15 carrier protein including but not limited to keyhole limpet hemacyanin (KLH), chicken 
ovalbumin, or bovine serum albumin using various cross-linking reagents including 
carbodimides, glutaraldehyde, or if the peptide contains a cysteine, N-methylmaleimide. A 
preferred peptide for immunization when conjugated to KLH contains the C-terminus ot 
Hu_Aspl or Hu-Asp2 comprising QRRPRDPEVVNDESSLVRHRWK or 
20 LRQQHDDFADDISLLK. respectively. 

The Hu-Asp nucleic acid molecules of the present invention are also valuable for 
chromosome identification, as they can hybridize with a specific location on a human 
chromosome. Hu-Asp 1 has been localized to chromosome 21. while Hu-Asp2 has been 
localized to chromosome 1 lq23.3-24.1. There is a current need for identifying particular sites 
25 on the chromosome, as few chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for marking chromosomal location. Once a 
sequence has been mapped to a precise chromosomal location, the physical position of the 
sequence on the chromosome can be correlated with genetic map data. The relationship 
between genes and diseases that have been mapped to the same chromosomal region can then 
30 be identified through linkage analysis, wherein the coinheritance of physically adjacent genes 
is determined. Whether a gene appearing to be related to a particular disease is in fact the 
cause ot" the disease can then be determined by comparing the nucleic acid sequence between 
50 affected and unaffected individuals. 

27 



55 



10 



15 



20 



25 



30 



35 



40 



45 



WO 00/17369 



PCT/US99/2088I 



5 In another embodiment, the invention relates to a method of assaying Hu-Asp function, 

specilically Hu-Asp2 function wruch involves incubating in solution the Hu-Asp polypeptide 
with a suitable substrate including but not limited to a synthetic peptide containing the p- 
w 5 secretase cleavage site of APP, preferably one containing the mutation found in a Swedish 

kindred with inherited AD in which KM is changed to ML, such peptide comprising the 
sequence SEVNLDAEFR in an acidic buffering solution, preferably an acidic buffering 
solution of pH5.5 (see Example 12) using cleavage of the peptide monitored by high 
15 performance liquid chromatography as a measure of Hu-Asp proteolytic activity. Preferred 

10 assays for proteolytic activity utilize internally quenched peptide assay substrates. Such 
suitable substrates include peptides which have attached a paired flurophore and quencher 
20 including but not limited to coumarin and dinitrophenol, respectively, such that cleavage of 

the peptide by the Hu-Asp results in increased fluorescence due to physical separation of the 
flurophore and quencher. Preferred colorimetric assays of Hu-Asp proteolytic activity utilize 
15 other suitable substrates that include the P2 and PI amino acids comprising the recognition 
25 site for cleavage linked to o-nitrophenol through an amide linkage, such that cleavage by the 

Hu-Asp results in an increase in optical density after altering the assay buffer to alkaline pH. 

In another embodiment, the invention relates to a method for the identification of an 
30 agent that increases the activity of a Hu-Asp polypeptide selected from the group consisting of 

20 Hu-Asp 1 , Hu-Asp2(a), and Hu-Asp2(b), the method comprising 

(a) determining the activity of said Hu-Asp polypeptide in the presence of a test 
agent and in the absence of a test agent; and 
35 (b) comparing the activity of said Hu-Asp polypeptide determined in the 

presence of said test agent to the activity of said Hu-Asp polypeptide 
25 determined in the absence of said test agent; 

40 whereby a higher level of activity in the presence of said test agent than in the absence of said 

test agent indicates that said test agent has increased the activity of said Hu-Asp polypeptide. 
Such tests can be performed with Hu-Asp polypeptide in a cell free system and with cultured 
cells that express Hu-Asp as well as variants or isoforms thereof. 
45 30 In another embodiment, the invention relates to a method for the identification of an 

agent that decreases the activity of a Hu-Asp polypeptide selected from the group consisting 
of Hu-Asp 1, Hu-Asp2(a), and Hu-Asp2(b), the method comprising 

50 
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(a) determining the activity of said Hu-Asp polypeptide in the presence of a test 
agent and in the absence of a test agent; and 

(b) comparing the activity of said Hu-Asp polypeptide determined in the 
- presence of said test agent to the activity of said Hu-Asp polypeptide 

determined in the absence of said test agent; 
whereby a lower level of activity in the presence of said test agent than in the absence of said 
test agent indicates that said test agent has decreased the activity of said Hu-Asp polypeptide. 
Such tests can be performed with Hu-Asp polypeptide in a cell free system and with cultured 
cells that express Hu-Asp as well as variants or isoforms thereof. 

In another embodiment, the invention relates to a novel cell line (HEK 125.3 cells) 
for measuring processing of amyloid (3 peptide (Ap) from the amyloid protein precursor 
(APP). The cells are stable transformanls of human embryonic kidney 293 cells (HEK293) 
with a bicistronic vector derived from pIRES-EGFP (Clontech) containing a modified 
human APP cDNA. an internal ribosome entry site and an enhanced green fluorescent 
protein (EGFP) cDNA in the second cistron. The APP cDNA was modified hy adding two 
lysine codons to the carboxyl terminus of the APP coding sequence. This increases 
processing of Ap peptide from human APP by 2-4 fold. This level of A0 peptide 
processing is 60 fold higher than is seen in nontransformed HEK293 cells. HEK 125.3 cells 
will be useful for assays of compounds that inhibit Ap peptide processing. This invention 
also includes addition of two lysine residues to the C-terminus of other APP isoforms 
including the 751 and 770 amino acid isoforms, to isoforms of APP having mutations found 
in human AD including the Swedish KM-»NL and V717^F mutations, to C-terminal 
fragments of APP, such as those beginning with the p-sccretase cleavage site, to C-terminal 
fragments of APP containing the P-secretase cleavage site which have been operably linked 
to an N-terminal signal peptide for membrane insertion and secretion, and to C-terminal 
fragments of APP which have been operably linked to an N-terminal signal peptide for 
membrane insertion and secretion and a reporter sequence including but not limited to 
green fluorescent protein or alkaline phosphatase, such that p-secretasc cleavage releases 
the reporter protein from the surface of cells expressing the polypeptide, 

Having generally described the invention, the same will be more readily 

understood by reference to the following examples, which are provided by way of illustration 

and are not intended as limiting. 
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EXAMPLES 

Example I: Development of a Search Algorithm Useful for the Identification of 
Aspartyl Proteases, and Identification ofC. elegans Aspartyl Protease 
. Genes in Wormpep 12: 



,5 10 



Materials and Methods: 

Classical aspartyl proteases such as pepsin and renin possess a two-domain structure 
which folds to hring two aspartyl residues into proximity within the active site. These are 
embedded in the short tripeptide motif DTG, or more rarely, DSG. The DTG or DSG 
active site motif appears at about residue 25-30 in the enzyme, but at about 65-70 in the 
proenzyme (prorenin, pepsinogen). This motif appears again about 150-200 residues 
downstream. The proenzyme is activated by cleavage of the N-tcrminal prodomain. This 
20 pattern exemplifies the double domain structure of the modern day aspartyl enzymes which 

15 apparently arose by gene duplication and divergence. Thus; 

NH 2 X— »D 25 TG -Y -D Y+25 TG C 

?5 where X denotes the beginning of the enzyme, following the N-termtnal prodomain, and Y 

denotes the center of the molecule where the gene repeat begins again. 

In the case of the retroviral enzymes such as the HIV protease, they represent only a 
20 half of the two-domain structures of well-known enzymes like pepsin, cathepsin D, renin, 
30 etc. They have no prosegment, but are carved out of a polyprotcin precursor containing the 

gag and pol proteins of the virus. They can be represented by: 

NH2 D^TG- — -C100 

This "monomer" only has about 100 aa, so is extremely parsimonious as compared to the 
25 other aspartyl protease "dimers" which have of the order of 330 or so aa f not counting the 
N-terminal prodomain. 

The limited length of the eukaryotic aspartyl protease active site motif makes it 
40 difficult to search EST collections for novel sequences. EST sequences typically average 

250 nucleotides, and so in this case would be unlikely to span both aspartyl protease active 
30 site motifs. Instead, we turned to the C. elegans genome. The C. elegans genome is 

estimated to contain around 13,000 genes. Of these, roughly 12,000 have been sequenced 
and the corresponding hypothetical open reading frame (ORF) has been placed in the 
database Wormpep 12. We used this database as the basis for a whole genome scan of a 
higher eukaryote for novel aspartyl proteases, using an algorithm that we developed 
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specifically for this purpose. The following AWK script for locating proteins containing two 
5 DTG or DSG motifs was used for the search, which was repeated four times to recover all 

pairwise combinations of the aspartyl motif. 

BEGINtRS^V) f* defines as record separator for FASTA format *7 

5 { 

w pos = index($0,"DTG") /*finds "DTG" in record*/ 

if (pos>0) ( 

rest = substr($0,pos+3) /*get rest of record after first DTG*/ 
pos2 = index(resL, M DTG") /*find second DTG*/ 
10 if (pos2>0) printf ("%s%s\nT V\$0) } /^report hits*/ 

15 ) 
) 

The AWK script shown above was used to search Wormpepl2, which was 
downloaded from ftp.sanger.ac.uk/pub/databases/wormpep, for sequence entries containing 
20 15 at least two DTG or DSG motifs. Using AWK limited each record to 3000 characters or 

less. Thus, 35 or so larger records were eliminated manually from 
Wormpep 1 2 as in any case these were unlikely to encode aspartyl proteases. 

25 

Results and Discussion: 
20 The Wormpep 12 database contains 12,178 entries, although some of these (<10%) 

represent alternatively spliced transcripts from the same gene. Estimates of the number of 
genes encoded in the C. elegans genome is on the order of 13,000 genes, so Wormpep 12 
may be estimated to cover greater than 90% of the C. elegans genome. 

Eukaryotic aspartyl proteases contain a two-domain structure, probably arising from 
25 ancestral gene duplication. Each domain contains the active site motif D(S/T)G located 
from 20-25 amino acid residues into each domain. The retroviral (e.g., HIV protease) or 
retrotransposon proteases are homodimers of subunits which are homologous to a single 
eukaryotic aspartyl protease domain. An AWK script was used to search the Wormpep 12 
40 database for proteins in which the D(S/T)G motif occurred at least twice. This identified 

30 >60 proteins with two DTG or DSG motifs. Visual inspection was used to select proteins 
in which the position of the aspartyl domains was suggestive of a two-domain structure 
^ meeting the criteria described above. 

In addition, the PROSITE eukaryotic arid viral aspartyl protease active site pattern 
PS00141 was used to search Wormpep 12 for candidate aspartyl proteases. (Bairoch A., 
35 Bucher P., Hofmann K M The PROSITE database: its status in 1997, Nucleic Acids Res. 
50 24:217-221(1997)). This generated an overlapping set of Wormpept2 sequences. Of these, 
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seven sequences contained two DTG or DSG motifs and the PROSITE aspartyl protease 
active site pattern. Of these seven, three were found in the same cosmid clone (F21F8.3, 
F21F8.4, and F2IF8.7) suggesting that they represent a family of proteins that arose by 
ancestral gene duplication. Two other ORFs with extensive homology to F21F8.3, F21F8.4 
5 and F21F8.7 are present in the same gene cluster (F21F8.2 and F21F8.6), however, these 
contain only a single DTG motif. Exhaustive BLAST searches with these seven sequences 
against Wormpepl2 failed to reveal additional candidate aspartyl proteases in the C. 
elegans genome containing two repeats of the DTG or DSG motif. 

BLASTX search with each C. elegans sequence against SW1SS-PROT, GenPep and 
10 TREMBL revealed that R12H7.2 was the closest worm homologue to the known 

mammalian aspartyl proteases, and that T18H9.2 was somewhat more distantly related, 
while CEASP1, F21F8.3, F21F8.4, and F21F8.7 formed a subcluster which had the least 
sequence homology to the mammalian sequences. 
Discussion: 

15 APP, the presenilis, and p35, the activator of cdk5, all undergo intracellular 

proteolytic processing at sites which conform to the substrate specificity of the HIV 
protease. Dysregulation of a cellular aspartyl protease with the same substrate specificity, 
might therefore provide a unifying mechanism for causation of the plaque and tangle 
pathologies in AD. Therefore, we sought to identify novel human aspartyl proteases. A 

20 whole genome scan in C. elegans identified seven open reading frames that adhere to the 
aspartyl protease profile that we had identified. These seven aspartyl proteases probably 
comprise the complete complement of such proteases in a simple, multicellular eukaryotc. 
These include four closely related aspartyl proteases unique to C elegans which probably 
arose by duplication of an ancestral gene. The other three candidate aspartyl proteases 

25 (T18H9.2, R12H7.2 and CI 1D2.2) were found to have homology to mammalian gene 
sequences. 
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Example 2: Identification of Novel Human Aspartyl Proteases Using Database 
5 Mining by Genome Bridging 

Materials and Methods; 

5 Computer-assisted analysis of EST databases, cDNA , and predicted polypeptide 
^ sequences: 

Exhaustive homology searches of EST databases with the CEASP1, F21F8.3, 
F21F8.4, and F21F87 sequences failed to reveal any novel mammalian homologues. 
TBLASTN searches with R 121 17.2 showed homology to cathepsin D, cathepsin E, 
15 io pepsinogen A, pepsinogen C and renin, particularly around the DTG motif within the active 

site, but also failed to identify any additional novel mammalian aspartyl proteases. This 
indicates that the C elegans genome probably contains only a single lysosomal aspartyl 
2Q protease which in mammals is represented by a gene family that arose through duplication 

and consequent modification of an ancestral gene. 
L5 TBLASTN searches with T18H9.2, the remaining C. elegans sequence, identified 

several ESTs which assembled into a contig encoding a novel human aspartyl protease (Hu- 
25 ASP1). As is described above in Example 1, BLASTX search with the Hu-ASPl contig 

against SWISS-PROT revealed that the active site motifs in the sequence aligned with the 
active sites of other aspartyl proteases. Exhaustive, repetitive rounds of BLASTN searches 
20 against LifeSeq, LifeSeqFL, and the public EST collections identified 102 EST from 
multiple cDN A libraries that assembled into a single contig. The 5 1 sequences in this 
contig found in public EST collections also have been assembled into a single contig 
(THC213329) by The Institute for Genome Research (TIGR). The TIGR annotation 
35 indicates that they failed to find any hits in the database for the contig. Note that the TIGR 

25 contig is the reverse complement of the LifeSeq contig that we assembled. BLASTN search 
of Hu-ASPl against the rat and mouse EST sequences in ZooSeq revealed one homologous 
EST in each database (Incyte clone 70031 1523 and IMAGE clone 313341, GenBank 

40 J 

accession number W10530, respectively). 

TBLASTN searches with the assembled DNA sequence for Hu-ASPl against both 
30 LifeSeqFL and the public EST databases identified a second, related human sequence (Hu- 
45 Asp2) represented by a single EST (2696295). Translation of this partial cDNA sequence 

reveals a single DTG motif which has homology to the active site motif of a bovine aspartyl 
protease, NM 1 . 
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BLAST searches, contig assemblies and multiple sequence alignments were 
performed using the bioinformatics tools provided with the LifeSeq, LifeSeqFL and 
LifeSeq Assembled databases from Incyte. Predicted protein motifs were identified using 
either the ProSite dictionary (Motifs in GCG 9) or the Pfam database. 

5 Full-length cDNA cloning of Hu-Aspl 

The open reading frame of C. elegans gene T18H9.2CE was used to query Incyte 
LifeSeq and LifeSeq-FL databases and a single electronic assembly referred to as 
1 863920CE1 was detected. The 5* most cDNA clone in this contig, 1863920, was obtained 
from Incyte and completely sequenced on both strands. Translation of the open reading 

10 frame contained within clone 1863920 revealed the presence of the duplicated aspartyl 

protease active site motif (DTG/DSG) but the 5' end was incomplete. The remainder of the 
Hu-Aspl coding sequence was determined by 5' Marathon RACE analysis using a human 
placenta Marathon ready cDNA template (Clonelech). A 3'-antisense oligonucleotide 
primer specific for the 5* end of clone 1863920 was paired with the 5'-sense primer specific 

15 for the Marathon ready cDNA synthetic adaptor in the PCR. Specific PCR products were 
directly sequenced by cycle sequencing and the resulting sequence assembled with the 
sequence of clone 1863920 to yield the complete coding sequence of Hu-Asp- 1 (SEQ ID 
No. 1). 

Several interesting features are present in the primary amino acid sequence 
20 of Hu-Aspl (Figure 1, SEQ ID No. 2). The sequence contains a signal peptide (residues 1- 
20 in SEQ ID No. 2), a pro-segment, and a catalytic domain containing two copies of the 
aspartyl protease active site motif (DTG/DSG). The spacing between the first and second 
active site motifs is about 200 residues which should correspond to the expected size of a 
single, eukaryotic aspartyl protease domain. More interestingly, the sequence contains a 
25 predicted transmembrane domain (residues 469-492 in SEQ ID No.2) near its C-terminus 
which suggests that the protease is anchored in the membrane. This feature is not found in 
any other aspartyl protease. 
Cloning of a full-length Hu-Asp-2 cDNAs: 

As is described above in Example 1, genome wide scan of the Caenorhabditis 
30 elegans database WormPepl2 for putative aspartyl proteases and subsequent mining of 
human EST databases revealed a human ortholog to the C elegans gene T18H9.2 referred 
to as Hu-Asp 1 . The assembled contig for Hu-Asp 1 was used to query for human paralogs 
using the BLAST search tool in human EST databases and a single significant match 
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(2696295CE1) with approximately 60% shared identity was found in the LifeSeq FL 
database. Similar queries of cither gblOSPubEST or the family of human databases 
available from TIGR did not identify similar EST clones. cDNA clone 2696295, identified 
by single pass sequence analysis from a human uterus cDNA library, was obtained from 
5 Incyte and completely sequence on both strands. This clone contained an incomplete 1266 
bp open-reading frame that encoded a 422 amino acid polypeptide but lacked an initiator 
ATG on the 5' end. Inspection of the predicted sequence revealed the presence of the 
duplicated aspartyl protease active site motif DTG/DSG, separated by 194 amino acid 
residues. Subsequent queries of later releases of the LifeSeq EST database identified an 

10 additional ESTs, sequenced from a human astrocyte cDNA library (4386993), that appeared 
to contain additional 5' sequence relative to clone 2696295. Clone 4386993 was obtained 
from Incyte and completely sequenced on both strands. Comparative analysis of clone 
4386993 and clone 2696295 confirmed that clone 4386993 extended the open-reading 
frame by 31 amino acid residues including two in-frame translation initiation codons. 

15 Despite the presence of the two in- frame ATGs, no in-frame stop codon was observed 
upstream of the ATG indicating that the 4386993 may not be full-length. Furthermore, 
alignment of the sequences of clones 2696295 and 4386993 revealed a 75 base pair 
insertion in clone 2696295 relative to clone 4386993 that results in the insertion of 25 
additional amino acid residues in 2696295. The remainder of the Hu-Asp2 coding sequence 

20 was determined by 5' Marathon RACE analysis using a human hippocampus Marathon 
ready cDNA template (Clonetech). A 3'-antisense oligonucleotide primer specific for the 
shared 5'-region of clones 2696295 and 4386993 was paired with the 5'-sense primer 
specific for the Marathon ready cDNA synthetic adaptor in the PCR. Specific PCR 
products were directly sequenced by cycle sequencing and the resulting sequence assembled 

25 with the sequence of clones 2696295 and 4386993 to yield the complete coding sequence 
of Hu-Asp2(a) (SEQ ID No. 3) and Hu-Asp2(b) (SEQ ID No. 5), respectively. 

Several interesting features are present in the primary amino acid sequence of Hu- 
Asp2(a) (Figure 2 and SEQ ID No. 4) and Hu-Asp-2(b) (Figure 3, SEQ ID No. 6). Both 
sequences contain a signal peptide (residues 1-21 in SEQ ID No. 4 and SEQ ID No. 6), a 

30 pro-segment, and a catalytic domain containing two copies of the aspartyl protease active 
site motif (DTG/DSG). The spacing between the first and second active site motifs is 
variable due to the 25 amino acid residue deletion in Hu-Asp-2(b) and consists of 168- 
versus-\9A amino acid residues, for Hu-Asp2(b) and Hu-Asp-2(ah respectively. More 
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interestingly, both sequences contains a predicted transmembrane domain (residues 455-477 

in SEQ ID No.4 and 430-452 in SEQ ED No. 6) near their C-termini which indicates that the 

protease is anchored in the membrane. This feature is not found in any other aspartyl 

protease except Hu-Aspl. 

Example 3. Molecular cloning of mouse Asp2 cDNA and genomic DNA. 
Cloning and characterization of murine Asp2 cDNA— The murine ortholog of Hu_Asp2 



was cloned using a combination of cDNA library screening, PCR, and genomic cloning. 
Approximately 500,000 independent clones from a mouse brain cDNA library were 
screened using a 32 P-labeled coding sequence probe prepared from Hu_Asp2. Replicate 
10 positives were subjected to DNA sequence analysis and the longest cDNA contained the 
20 entire 3' untranslated region and 47 amino acids in the coding region. PCR amplification 

of the same mouse brain cDNA library with an antisense oligonucleotide primer specific for 
the 5 '-most cDNA sequence determined above and a sense primer specific for the 5' region 
of human Asp2 sequence followed by DNA sequence analysis gave an additional 980 bp of 
15 the coding sequence. The remainder of the 5' sequence of murine Asp-2 was derived from 
genomic sequence (see below). 

Isolation and sequence analysis of the murine Asp 2 gene — A murine EST sequence 
encoding a portion of the murine Asp2 cDNA was identified in the GenBank EST database 
using the BLAST search tool and the Hu-Asp2 coding sequence as the query. Clone 
20 g3 160898 displayed 88% shared identity to the human sequence over 352 bp. 

Oligonucleotide primer pairs specific for this region of murine Asp2 were then synthesized 
and used to amplify regions of the murine gene. Murine genomic DNA, derived from strain 
129/SvJ, was amplified in the PCR (25 cycles) using various primer sets specific for murine 
45 Asp2 and the products analyzed by agarose gel electrophoresis. The primer set Zoo- 1 and 

25 Zoo-4 amplified a 750 bp fragment that contained approximately 600 bp of intron sequence 
based on comparison to the known cDNA sequence. This primer set was then used to 
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screen a murine BAC library by PCR, a single genomic clone was isolated and this cloned 

was confirmed contain the murine Asp2 gene by DNA sequence analysis. Shotgun DNA 

sequencing of this Asp2 genomic clone and comparison to the cDNA sequences of both 

Hu_Asp2 and the partial murine cDNA sequences defined the full-length sequence of 

5 murine Asp2 (SEQ ED No. 7). The predicted amino acid sequence of murine Asp2 (SEQ ID 

No. 8) showed 96.4% shared identity (CCG BestFit algorithm) with 18/501 amino acid 

residue substitutions compared to the human sequence (Figure 4). 

Example 4: Tissue Distribution of Expression of Hu- Asp 2 Transcripts: 
Materials and Methods: 

20 io The tissue distribution of expression of Hu-Asp-2 was determined using multiple 

tissue Northern blots obtained from Clonetech (Palo Alto, CA). Incyte clone 2696295 in 
the vector pINCY was digested to completion with EcoRVNotl and the 1 .8 kb cDNA insert 
purified by preparative agarose gel electrophoresis. This fragment was radiolabeled to a 
specific activity > 1 X 10 9 dpm/u,g by random priming in the presence of [ct- 32 P-dATP] 
15 (>3000 Ci/mmol, Amersham, Arlington Heights, IL) and Klenow fragment of DNA 

polymerase L Nylon filters containing denatured, size fractionated poly A + RNAs isolated 
30 from different human tissues were hybridized with 2 x IO 6 dpm/ml probe in ExpressHyb 

buffer (Clonetech, Palo Alto, CA) for 1 hour at 68 °C and washed as recommended by the 
manufacture. Hybridization signals were visualized by autoradiography using BioMax XR 
20 film (Kodak, Rochester, NY) with intensifying screens at -80 °C. 

35 

Results and Discussion: 

Limited information on the tissue distribution of expression of Hu-Asp-2 transcripts 
was obtained from database analysis due to the relatively small number of ESTs detected 
40 using the methods described above (< 5). In an effort to gain further information on the 

25 expression of the Hu-Asp2 gene, Northern analysis was employed to determine both the 
size(s) and abundance of Hu-Asp2 transcripts. PolyA + RNAs isolated from a series of 
peripheral tissues and brain regions were displayed on a solid support following separation 
under denaturing conditions and Hu-Asp2 transcripts were visualized by high stringency 
hybridization to radiolabeled insert from clone 2696295. The 2696295 cDNA probe 
30 visualized a constellation of transcripts that migrated with apparent sizes of 3.0kb, 4.4 k:b 
50 and 8.0 kb with the latter two transcript being the most abundant. 



45 



37 



55 



10 



25 



WO 00/1 7369 PCT/US99/2088 1 

Across Che tissues surveyed, Hu-Asp2 transcripts were most abundant in pancreas 

and brain with lower but detectable levels observed in all other tissues examined except 

thymus and PBLs. Given the relative abundance of Hu-Asp2 transcripts in brain, the 

regional expression in brain regions was also established. A similar constellation of 

5 transcript sizes were detected in all brain regions examined [cerebellum, cerebral cortex, 

occipital pole, frontal lobe, temporal lobe and putamen] with the highest abundance in the 

medulla and spinal cord. 

Example 5: Northern Blot Detection of HuAsp-l and HuAsp-2 Transcripts in 
15 Human Cell Lines: 

10 A variety of human cell lines were tested for their ability to produce Hu-Aspl and 

Asp2 mRNA. Human embryonic kidney (HEK-293) cells, African green monkey (Cos-7) 
cells, Chinese hamster ovary (CHO) cells, HELA cells, and the neuroblastoma cell line 
20 EvtR-32 were ail obtained from the ATCC. Cells were cultured in DME containing 10% 

FCS except CHO cells which were maintained in ot-MEM/10% PCS at 37 °C in 5% C0 2 
15 until they were near confluence. Washed monolayers of cells (3 X 10 7 ) were lysed on the 
dishes and poly A 4 " RNA extracted using the Qiagen Oligotex Direct mRNA kit. Samples 
containing 2 p.g of poly A + RNA from each cell line were fractionated under denaturing 
conditions (glyoxal-treated), transferred to a solid nylon membrane support by capillary 
action, and transcripts visualized by hybridization with random-primed labeled ( 32 P) coding 
30 20 sequence probes derived from either Hu-Aspl or Hu-Asp2. Radioactive signals were 

detected by exposure to X-ray film and by image analysis with a Phosphorlmager. 

The Hu-Aspl cDNA probe visualized a similar constellation of transcripts (2.6 kb 
and 3.5 kb) that were previously detected is human tissues. The relative abundance 
determined by quantification of the radioactive signal was Cos-7 > HEK 292 = HELA > 
25 IMR32. 

The Hu-Asp2 cDNA probe also visualized a similar constellation of transcripts 

40 compared to tissue (3.0 kb, 4.4 kb, and 8.0 kb) with the following relative abundance; HEK 

293 > Cos 7 > IMR32 > HELA. 

Example 6: Modification of APP to increase A0 processing for in vitro 
30 screening 

45 Human cell lines that process Ap peptide from APP provide a means to screen in 

cellular assays for inhibitors of P~ and y-sccrctasc. Production and release of AP peptide 
into the culture supernatant is monitored by an enzyme-linked immunosorbent assay (ElA). 
Although expression of APP is widespread and both neural and non-neuronal cell lines 
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process and release Ap peptide, levels of endogenous APP processing are low and difficult 
to detect by EIA. Ap processing can be increased by expressing in transformed cell lines 
mutations of APP that enhance Ap processing. We made the serendipitous observation that 
addition of two lysine residues Lo the carboxyl terminus of APP695 increases Ap processing 
10 5 still further. This allowed us to create a transformed cell line that releases Ap peptide into 

the culture medium at the remarkable level of 20,000 pg/ml. 
Materials And Methods 
Materials: 

Human embryonic kidney cell line 293 (HEK293 cells) were obtained internally. 

10 The vector pIRES-EGFP was purchased from Clontech. Oligonucleotides for mutation 

using the polymerase chain reaction (PCR) were purchased from Genosys. A plasmid 

20 containing human APP695 (SEQ ID No. 9 [nucleotide] and SEQ ID No. 10 [amino acid]) 

was obtained from Northwestern University Medical School. This was subcloned into pSK 

(Strutagene) at the Not\ site creating the plasmid pAPP695. 

15 Mutagenesis protocol: 
25 The Swedish mutation (K670N, M671L) was introduced into pAPP695 using the 

Stratagene Quick Change Mutagenesis Kit to create the plasmid p APP695NL (SEQ ID No. 

1 1 [nucleotide! and SEQ ED No. 12 [amino acid]). To introduce a di-lysine motif at the C- 

terminus of APP695, the forward primer #276 5' GACTGACCACTCGACCAGGTTC 

30 

20 (SEQ ID No. 47) was used with the "patch" primer #274 5* 

CGAATTAAATTCCAGCACACrGGCTACTTCTTGTTCTGCATCTCAAAGAAC (SEQ 
ID No. 48) and the flanking primer #275 CGAATTAAATTCCAGCACACTGGCTA (SEQ 
35 jo N 0t 49) t0 modify the 3" end of the APP695 cDNA (SEQ ID No. 15 [nucleotide] and 

SEQ ID No. 16 [amino acid]). This also added a BstXl restriction site that will be 
25 compatible with the BstXl site in the multiple cloning site of pIRES-EGFP. PCR 
^ amplification was performed with a Clontech HF Advantage cDNA PCR kit using the 

polymerase mix and buffers supplied by the manufacturer. For "patch" PCR, the patch 
primer was used at 1 /20th the molar concentration of the flanking primers. PCR 
amplification products were purified using a QIAquick PCR purification kit (Qiagen). 
45 30 After digestion with restriction enzymes, products were separated on 0.8% agarose gels and 

then excised DNA fragments were purified using a QIAquick gel extraction kit (Qiagen). 

To reassemble a modified APP695-Sw cDNA, the 5' Notl-Bgl2 fragment of the 
APP695-Sw cDNA and the V Bgl2-BstXl APP695 cDNA fragment obtained by PCR were 
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ligatcd into pIRES-EGFP plasrrud DNA opened at the Notl and BstXl sites. Ligations 
5 were performed for 5 minutes at room temperature using a Rapid DNA Ligation kit 

(Boehringer Mannheim) and transformed into Library Efficiency DH5a Competent Cells 
(GibcoBRL-Life Technologies). Bacterial colonies* were screened for inserts by PCR 
10 5 amplification using primers #276 and #275. Plasmid DNA was purified for mammalian 

cell transfection using a QIAprep Spin Miniprcp kit (Qiagen). The construct obtained was 
designated pMG125.3 (APPSW-KK. SEQ ID No. 17 [nucleotide] and SEQ ID No. 18 
[amino acid]). 

15 

Mammalian Cell Transfection: 
10 HEK293 cells for transfection were grown to 80% confluence in Dulbecco's 

modified Eagle's medium (DMEM) with 10% fetal bovine serum. Cotransfections were 
20 performed using LipofectAmine (Gibco-BRL) with 3 ug pMG 125.3 DNA and 9 u.g 

pcDNA3.1 DNA per 10 x 10 6 cells. Three days posttransfection, cells were passaged into 
medium containing G4 1 8 at a concentration of 400 u.g/ml. After three days growth in 
15 selective medium, cells were sorted by their fluorescence. 
25 Clonal Selection of 125 J cells by FACS: 

Cell samples were analyzed on an EPICS Elite ESP flow cytometer (Coulter, 
Hialcah, FL) equipped with a 488 nm excitation line supplied by an air-cooled argon laser. 
30 EGFP emission was measured through a 525 nm band-pass filter and fluorescence intensity 

20 was displayed on a 4-decade log scale after gating on viable cells as determined by forward 
and right angle light scatter. Single green cells were separated into each well of one 96 well 
plate containing growth medium without G41 8. After a four day recovery period, G41 8 
35 was added to the medium to a final concentration of 400 jig/ml. After selection, 32% of the 

wells contained expanding clones. Wells with clones were expanded from the 96 well plate 
25 to a 24 well plate and then a 6 well plate with the fastest growing colonies chosen for 
40 expansion at each passage. The final cell line selected was the fastest growing of the final 

six passaged. This clone, designated 125.3, has been maintained in G418 at 400 ug/ml with 
passage every four days into fresh medium. No loss of A(3 production of EGFP 
fluorescence has been seen over 23 passages. 

45 

30 A/3 EI A Analysis (Double Antibody Sandwich ELISA for HA ft 1-40/42): 

Cell culture supematants harvested 48 hr after transfection were analyzed in a 
standard A$ EIA as follows. Human Ap 1-40 or 1-42 was measured using monoclonal 
50 antibody (mAb) 6E10 (Senetek, St. Louis, MO) and biotinylated rabbit antiserum 162 or 
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164 (New York State Institute for Basic Research, Staten Island, NY) in a double antibody 
5 sandwich ELISA. The capture antibody 6E10 is specific to an epitope present on the N- 

terminal amino acid residues 1-16 of hAp. The conjugated detecting antibodies 162 and 
164 are specific for hAp 1-40 and 1-42, respectively. Briefly, a Nunc Maxisorp 96 well 
W 5 immunoplate was coated with 100 U-l/well of mAb 6E10 (5u.g/ml) diluted in 0.1M 

carbonate-bicarbonate buffer, pH 9.6 and incubated at 4°C overnight. After washing the 
plate 3x with 0.01M DPBS (Modified Dulbecco's Phosphate Buffered Saline (0.008M 
sodium phosphate, 0.O02M potassium phosphate, 0. 14M sodium chloride, 0.01 M 
potassium chloride, pH 7.4) from Pierce, Rockford, II) containing 0.05% of Tween-20 
10 (DPBST), the plate was blocked for 60 min with 200[il of 10% normal sheep serum 

(Sigma) in 0.0 1M DPBS to avoid non-specific binding. Human A£ 1-40 or 1-42 standards 
20 100u.l/well (Bachem, Torrance, CA) diluted, from a lmg/ml stock solution in DMSO, in 

culture medium was added after washing the plate, as well as 100pJ/well of sample, 
e.g.conditioned medium of transfected cells. The plate was incubated for 2 hours at room 
15 temperature and 4°C overnight. The next day, after washing the plate, lOOu-lAvell 

biotinylated rabbit antiserum 162 1:400 or 164 1:50 diluted in DPBST + 0.5% BSA was 
added and incubated at room temperature for Ihr 15 min. Following washes, lOOud/well 
neutravidin-horseradish peroxidase (Pierce, Rockford, II) diluted 1:10,000 in DPBST was 
applied and incubated for 1 hr at room temperature. After the last washes lOOjilAvcll of o- 
20 phenylnediamine dihydrochloride (Sigma Chemicals, St. Louis, MO) in 50mM citric 
acid/lOOmM sodium phosphate buffer (Sigma Chemicals, St. Louis, MO), pH 5.0, was 
added as substrate and the color development was monitored at 450nm in a kinetic 
microplate reader for 20 min. using Soft max Pro software. All standards and samples were 
run in triplicates. The samples with absorbance values falling within the standard curve 
25 were extrapolated from the standard curves using Soft max Pro software and expressed in 
pg/ml culture medium. 
Results: 

Addition of two lysine residues to the carboxyl terminus of APP695 greatly 
increases A(i processing in HEK293 cells as shown by transient expression (Table 1 ). 
30 Addition of the di-lysine motif to APP695 increases Ap processing to that seen with the 

APP695 containing the Swedish mutation. Combining the di-lysine motif with the Swedish 
mutation further increases processing by an additional 2.8 fold. 
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Cotrans formation of HEK293 cells with pMG!25.3 and pcDNA3.1 allowed dual 

selection of transformed cells for G418 resistance and high level expression of EGFP. 

After clonal selection by FACS, the cell line obtained, produces a remarkable 20,000 pg A (3 

peptide per ml of culture medium after growth for 36 hr in 24 well plates. Production of 

5 A(5 peptide under various growth conditions is summarized in Table 2. 

TABLE 1 . Release of A0 peptide into the culture medium 48 hr after transient 
transfection of HEK293 cells with the indicated vectors containing wildtypc or modified 
APP. Values tabulated are mean + SD and P-value for pairwise comparison using Student's 
t-test assuming unequal variances. 



APP Construct 


AP 1-40 peptide 
(pg/ml) 


Fold Increase 


P-value 


pIRES-EGFP vector 


147 + 28 


1.0 




wt APP695 (142.3) 


194+ 15 


1.3 


0.051 


wt APP695-KK (124.1) 


424 + 34 


2.8 


3 x 10-5 


APP695-Sw (143.3) 


457 + 65 


3.1 


2 x 10-3 


APP695-SwKK (125.3) 


1308 + 98 


8.9 


3 x 10-4 



TABLE 2. Release of AP peptide from IIEK125.3 cells under various growth 
25 conditions. 



Type of Culture 


Volume of 


Duration of 


Ab 1-40 


Ab 1-42 


Plate 


Medium 


Culture 


(pg/ml) 


(pg/ml) 


24 well plate 


400 ul 


36 hi 


28,036 


1,439 



15 

Example 7: Antisense oligomer inhibition of Abeta processing in HEK125.3 cells 
The sequences of Hu-Aspl and Hu-Asp2 were provided to Sequitur, Inc (Natick, 

MA) for selection of targeted sequences and design of 2nd generation chimeric antisense 
35 oligomers using prorietary technology (Sequitur Ver. D Pat pending #3002). Antisense 

20 oligomers Lot# S644, S645, S646 and S647 were targeted against Aspl. Antisense 

oligomers Lot# S648, S649, S650 and S651 were targeted against Asp2. Control antisense 
40 oligomers Lot# S652, S653, S655, and S674 were targeted against an irrelevant gene and 

antisense oligomers Lot #S656, S657, S65S, and S659 were targeted against a second 

irrelevant gene. 

25 For transfection with the antisense oligomers, HEK125.3 cells were grown to about 

45 50% confluence in 6 well plates in Minimal Essential Medium (MEM) supplemented with 

10% fetal calf serum. A stock solution of oligofectin G (Sequitur Inc., Natick, MA) at 2 
mg/ml was diluted to 50 U-g/ml in serum free MEM. Separately, the antisense oligomer 
50 stock solution at 100 jiM was diluted to 800 nM in Opti-MEM (GffiCO-BRL, Grand 
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Island, NY). The diluted stocks of oligofeain G and antisense oligomer were then mixed at 
5 a ratio of 1 : 1 and incubated at room temperature. After 1 5 min incubation, the reagent was 

diluted 10 fold into MEM containing 10% fetal calf serum and 2 ml was added to each well 
of the 6 well plate after first removing the old medium. After transfection, cells were 
10 5 grown in the continual presence of the oligofectin G/antisense oligomer. To monitor Ab 

peptide release, 400 uJ of conditioned medium was removed periodically from the culture 
well and replaced with fresh medium beginning 24 hr after transfection. Data reported are 
from culture supernatant harvested 48 hi after transfection. 

15 

Results: 

10 The 16 different antisense oligomers obtained from Sequitur Inc were transfected 

separately into HEK125.3 cells to determine their affect on A0 peptide processing. Only 
20 antisense oligomers targeted against Asp I & Asp2 reduced Abcta processing by I1EK125.3 

cells with those targeted against Asp2 having a greater inhibitory effect. Both A3 (1-40) 
and Afl (1-42) were inhibited by the same degree. In Table 3, percent inhibition is 

15 calculated with respect to untransfected cells. Antisense oligomer reagents giving greater 

25 

than 50% inhibition are marked with an asterisk. Of the reagents tested, 3 of 4 antisense 
oligomers targeted against ASP1 gave an average 52% inhibition of AP M0 processing 
and 47% inhibition of Ap 1-42 processing. For ASP2, 4 of 4 antisense oligomers gave 
30 greater than 50% inhibition with an average inhibition of 62% for Ap 1-40 processing and 

20 60% for Ap 1-42 processing. 

Table 3. Inhibition of AP peptide release from HEK125.3 cells treated with antisense 



35 



40 



45 



oligomers. 








Gene Targeted 


Antisense Oligomer 


Abcta (1-40) 


Abcta (L-42) 


Aspl-1 


S644 


62%* 


56%* 


Aspl-2 


S645 


41%* 


38%* 


Asp 1-3 


S646 


52%* 


46%* 


Asp 1-4 


S647 


6% 


25% 


Asp2-1 


S648 


71%* 


67%* 


Asp2-2 


S649 


83%* 


76%* 


Asp2-3 


S650 


46%* 


50%* 


Asp2-4 


S651 


47%* 


46%* 


Conl-1 


S652 


13% 


18% 


Con 1-2 


S653 


35% 


30% 


Con 1-3 


S655 


9% 


18% 


Con 1-4 


S674 


29% 


18% 


Con2-l 


S656 


12% 


18% 


Con2-2 


S657 


16% 


19% 


Con2-3 


S658 


8% 


35% 
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Example 8. Demonstration of Hu-Asp2 P- Secretase Activity in Cultured Cells 

5 

Several mutations in APP associated with early onset Alzheimer's disease have been 
shown to alter Ap peptide processing. These flank the N- and C-terminal cleavage sites 
that release AC from APP. These cleavage sites are referred to as the P-secretase and y- 

10 5 secretase cleavage sites, respectively. Cleavage of APP at the P-secretase site creates a C- 

terminal fragment of APP containing 99 amino acids of 1 1,145 daltons molecular weight. 
The Swedish KM->NL mutation immediately upstream of the P-secretase cleavage site 

15 causes a general increase in production of both the 1-40 and 1-42 amino acid forms of AC 

peptide. The London VF mutation (V717-»F in the APP770 isoform) has little effect on 
10 total AD peptide production, but appears to preferentially increase the percentage of the 
longer 1-42 amino acid form of AD peptide by affecting the choice of y-sccretase cleavage 

20 

site used during APP processing. Thus, we sought to determine if these mutations altered 
the amount and type of AD peptide produced by cultured cells cotransfected with a 
construct directing expression of IIu-Asp2. 
25 15 Two experiments were performed which demonstrate Hu-Asp2 P-secretase 

activity in cultured cells. In the first experiment, treatment of HEK 1 25.3 cells with antisense 

oligomers directed against Hu-Asp2 transcripts as described in Example 7 was found to 

30 

decrease the amount of the C-terminal fragment of APP created by p-secretase cleavage 
(CTF99) (Figure 9). This shows that Hu-Asp2 acts directly or indirecdy to facilitate P- 

35 20 secretase cleavage. In the second experiment, increased expression of Hu-Asp2 in 

transfected mouse Neuro2A cells is shown to increase accumulation of the CTF99 0- 
secretase cleavage fragment (Figure 10). This increase is seen most easily when a mutant 

40 APP-KK clone containing a C-terminal di-lysine motif is used for transfection. A further 

increase is seen when Hu-Asp2 is cotransfected with APP-Sw-KK containing the Swedish 
25 mutation KM ->NL. The Swedish mutation is known to increase cleavage of APP by the P~ 

45 

secretase. 
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A second set of experiments demonstrate Hu-Asp2 facilitates y-secretase activity in 
5 cotransfection experiments with human embryonic kidney HEK293 ceils. Cotransfection of 

Hu-Asp2 with an APP-KK clone greatly increases production and release of soluble AfJl-40 
and A01-42 peptides from IIEK293 cells. There is a proportionately greater increase in the 
10 5 release of AP 1-42. A further increase in production of Afil-42 is seen when Hu-Asp2 is 

cotransfected with APP-VF (SEQ ID No. 13 [nucleotide] and SEQ ID No. 14 [amino acid]) 
or APP-VF-KK SEQ ID No. 19 [nucleotide] and SEQ ID No. 20 [amino acid]) clones 
containing the London mutation V717— »F. The V717-*F mutation is known to alter 

15 

cleavage specificity of the APP y-secretase such that the preference for cleavage at the A(542 
10 site is increased. Thus, Asp2 acts directly or indirectly to facilitate y-secretase processing of 
APP at the 042 cleavage site. 

20 

Materials 

Antibodies 6E10 and 4G8 were purchased from Senetek (St. Louis, MO). Antibody 369 
was obtained from the laboratory of Paul Greengard at the Rockefeller University. 
15 Antibody C8 was obtained from the laboratory of Dennis Selkoe at the Harvard Medical 
School and Brigham and Women's Hospital. 
APP Constructs used 

30 The APP constructs used for transfection expenments comprised the following 

20 APP wild-type APP695 (SEQ ID No. 9 and No. 10) 

APP-Sw APP695 containing the Swedish KM->NL mutation (SEQ ID No. 1 1 

and No. 12), 

APP-VF APP695 containing the London V-*F mutation (SEQ ED No. 13 and 

No. 14) 

25 APP-KK APP695 containing a C-terminal KK motif (SEQ ID No. 15 and No. 

16), 

APP-Sw-KK APP695-Sw containing a C-terminal KK motif (SEQ ED No. 17 and 
No. 18). 

APP-VF-KK APP695-VF containing a C-terminal KK motif (SEQ ID No. 19 and 
30 No. 20). 

These were inserted into the vector pIRES-EGFP (Clontech. Palo Alto CA) between the 

Not\ andBstXl sites using appropriate linker sequences introduced by PCR. 

Transfection of antisense oligomers orplasmid DNA constructs in HEK293 cells, 
35 HEKI2S.3 cells and Neuro-2A cells, 
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Human embryonic kidney HEK293 cells and mouse Neuro-2a cells were transfected with 
5 expression constructs using the Lipofectamine Plus reagent from Gibco/BRI.. Cells were 

seeded in 24 well tissue culture plates to a density of 70-80% confluence. Four wells per 
plate were transfected with 2 fig DNA (3: 1, APPxotransfcctant), 8|xl Plus reagent, and 4(il 
10 5 Lipofectamine in OptiMEM. OptiMEM was added to a total volume of 1 ml, distributed 

200 |il per well and incubated 3 hours. Care was taken to hold constant the ratios of the two 
plasmids used for cotransfection as well as the total amount of DNA used in the 
transfection. The transfection media was replaced with DMEM, 10%FBS, NaPyruvate, 
with antibiotic/antimycotic and the cells were incubated under normal conditions (37°, 5% 
10 C0 2 ) for 48 hours. The conditioned media were removed to polypropylene tubes and 

stored at -80°C until assayed for the content of A(3 1 -40 and Ap 1 -42 by EIA as described in 
20 the preceding examples. Transfection of antisense oligomers into HEK125.3 cells was as 

described in Example 7. 



15 



25 



Preparation of cell extracts, Western blot protocol 

15 Cells were harvested after being transfected with plasmid DNA for about 60 hours. 

First, cells were transferred to 15-ml conical tube from the plate and centrifuged at 1,500 
rpm for 5 min to remove the medium. The cell pellets were washed with PBS for one time. 
30 We then lysed the cells with lysis buffer ( 10 mM HEPES, pH 7.9, 150 mM NaCl, 10% 

glycerol, 1 mM EGTA, 1 mM EDTA, 0.1 mM sodium vanadate and 1% NP-40). The lysed 
20 cell mixtures were centrifuged at 5000 rpm and the supernatant was stored at -20°C as the 
cell extracts. Equal amounts of extracts from HEK125.3 cells transfected with the Asp2 
35 antisense oligomers and controls were precipitated with antibody 369 that recognizes the C- 

terminus of APP and then CTF99 was detected in the immunoprecipitate with antibody 
6E10. The experiment was repeated using C8, a second precipitating antibody that also 
40 25 recognizes the C- terminus of APP. For Western blot of extracts from mouse Neuro-2a cells 

cotransfected with Hu-Asp2 and APP-KK, APP-Sw-KK, APP-VF-KK or APP-VF, equal 
amounts of cell extracts were electrophorcsed through 4-10% or 10-20% Tricine gradient 
gels (NOVEX, San Diego, CA). Full length APP and the CTF99 (i-secretase product were 
detected with antibody 6E10. 

30 Results 
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Transection of HEK125.3 cells with Asp2-t or Asp2-2 antisense oligomers reduces 
production of the CTF P-secretase product in comparison to cells similarly transfected with 
control oligomers having the reverse sequence (Asp2-I reverse & Asp2-2 reverse) 
In cotransfection experiments, cotransfection of Hu-Asp2 into mouse Neuro-2a cells with 
5 the APP-FCK construct increased the formation of CTF99. This was further increased if Hu- 
Asp2 was coexpressed with APP-Sw-KK, a mutant form of APP containing the Swedish 
KM-»NL mutation that increases p-secretase processing. 

Cotransfection of Hu-Asp2 with APP has little effect on A|}40 production but 
increases Ap42 production above background (Table 4). Addition of the di-lysine motif to 
10 the C-terminus of APP increases AP peptide processing about two fold, although AP40 and 
A|342 production remain quite low (352 pg/ml and 21 pg/ml, respectively). Cotransfection 
of Asp2 with APP-KK further increases both AP40 and AP42 production. The stimulation 
of A04O production by Hu-Asp2 is more that 3 fold, while production of Ap42 increases by 
more than 10 fold. Thus, cotransfection of Hu-Asp2 and APP-KK constructs preferentially 
15 increases Ap42 production. 

The APP V717->F mutation has been shown to increase y-secretase processing at the 
35 AP42 cleavage site. Cotransfection of Hu-Asp2 with the APP-VF or APP VF-KK 

constructs increased Ap42 production ( a two fold increase with APP-VF and a four-fold 
increase with APP-VF-KK, Table 4), but had mixed effects on AP40 production (a slight 

40 

20 decrease with APP-VF, and a two fold increase with APP-VF-KK in comparison to the 
pcDNA cotransfection control. Thus, the effect of Asp2 on AP42 production was 
45 proportionately greater leading to an increase in the ratio of Ap42/total Ap. Indeed, the ratio 

of Ap42/total AP reaches a very high value of 42% in HEK293 cells cotransfected with Hu- 
Asp2 and APP-VF-KK. 
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Western blot showing reduction of CTF99 production by HEK125.3 cells transfected with 
antisense oligomers targeting the Hu-Asp2 mRNA. (right) Western blot showing increase in 
CTF99 production in mouse Neuro-2a cells cotransfected with Hu-Asp2 and APP-KK. A 
further increase in CTF99 production is seen in cells cotransfected with Hu-Asp2 and APP- 
SwKK. 

Table 4. Results of cotransfecting Hu-Asp2 or pcDNA plasmid DNA with various APP 
constructs containing the V717-»F mutation that modifies Y-secretase processing. 
Cotransfection with Asp2 consistently increases the ratio of Ap42/total A0. Values 
tabulated are Ap peptide pg/ml. 







pcDNA 
Cotransfection 




Asp2 

Cotransfection 








A^40 


Ap42 


A[342/T«tal 


A04O 


AP42 


A342/Total 


APP 


192+18 


<4 


<2% 


188±40 


8+10 


3.9% 


APP-VF 


118+15 


15+19 


11.5% 


85+7 


24+12 


22.4% 


APP-KK 


352+24 


21+6 


5.5% 


1062+101 


226+49 


17.5% 


APP -VF-KK 


230+31 


88+24 


27.7% 


491+35 


355+36 


42% 



Example 9. Bacterial expression of human Asp2L 
Expression of recombinant Hu_Asp2E in E. coli. 

15 

Hu-Asp2L can be expressed in E. coli after addition of N-terminal sequences such as a T7 
tag (SEQ ID No. 21 and No. 22) or a T7 tag followed by a caspase 8 leader sequence (SEQ 
ID No. 23 and No. 24). Alternatively, reduction of the GC content of the 5* sequence by site 
directed mutagenesis can be used to increase the yield of Hu-Asp2 (SEQ ID No. 25 and No. 
20 26). In addition. Asp2 can be engineered with a proteolytic cleavage site (SEQ ID No. 27 
and No. 28). To produce a soluble protein after expression and refolding, deletion of the 
transmembrane domain and cytoplasmic tail, or deletion of the membrane proximal region, 
transmembrane domain, and cytoplasmic tail is preferred. 
Methods 
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10 



20 



PCR with primers containing appropriate linker sequences was used to assemble fusions of 
Asp2 coding sequence with N-terminal sequence modifications including a T7 tag (SEQ ID 
Nos. 2 1 and 22) or a T7-caspase 8 leader (SEQ ID Nos. 23 and 24). These constructs were 
5 cloned into the expression vector pet23a(+) [Novagen] in which a T7 promoter directs 
expression of a T7 tag preceding a sequence of multiple cloning sites. To clone Hu-Asp2 
sequences behind the T7 leader of pet23a+, the following oligonucleotides were used for 
amplification of the selected Hu-Asp2 sequence; 
15 #553=GTGGATCCACCCAGCACGGCATCCGGCTG (SEQ ID No. 35), 

10 #554=GAAAGCnTCATGACTCATCTGTCTGTGGAATGTTG (SEQ ID No. 36) which 
placed BamHI and Hindlil sites flanking the 5' and 3' ends of the insert, respectively. The 
Asp2 sequence was amplified from the full length Asp2(b) cDNA cloned into pcDNA3. 1 
using the Advantage-GC cDNA PCR [Clontech] following the manufacturer's supplied 
protocol using annealing & extension at 68 C C in a two-step PCR cycle for 25 cycles. The 
15 insert and vector were cut with BamHI and Hindffl, purified by electrophoresis through an 
25 agarose gel, then ligated using the Rapid DNA Ligation kit [Boerhinger Mannheim]. The 

ligation reaction was used to transform the E. coli strain JM109 (Promega) and colonies 
were picked for the purification of plasmid (Qiagen,Qiaprep minispin) and DNA sequence 
analysis . For inducible expression using induction with isopropyl b-D- 
30 20 thiogalactopyranoside (IPTG), the expression vector was transferred into E. coli strain 

BL21 (Statagene). Bacterial cultures were grown in LB broth in the presence of ampicillin 
at 100 ug/ml, and induced in log phase growth at an OD600 of 0.6-1.0 with 1 mM IPTG for 
35 4hourat37°C. The cell pellet was harvested by centrifugation. 

To clone Hu-Asp2 sequences behind the T7 tag and caspase leader (SEQ ID Nos. 23 
25 and 24), the construct created above containing the T7-Hu-Asp2 sequence (SEQ ID Nos. 2 1 
and 22) was opened at the BamH 1 site, and then the phosphorylated caspase 8 leader 

40 

oligonucleotides #559=GATCGATGACTATCTCTGACTCTCCGCGTGAACAGGACG 
(SEQ ID No. 37), #560=GATCCGTCCTGTTCACGCGGAGAGTCAGAGATAGTCATC 
(SEQ ID No. 38) were annealed and ligated to the vector DNA. The 5' overhang for each set 
45 30 of oligonucleotides was designed such that it allowed ligation into the BamHI site but not 

subsequent digestion with BamHI. The ligation reaction was transformed into JM109 as 
above for analysis of protein expression after transfer to E. coli strain BL21. 
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In order to reduce the GC content of the 5' terminus of asp2, a pair of antiparallel oligos 
were designed to change degenerate codon bases in 15 amino acid positions from G/C to 
A/T (SEQ ID Nos. 25 and 26). The new nucleotide sequence at the 5' end of asp2 did not 
change the encoded amino acid and was chosen to optimize E. Coli expression. The 
5 sequence of the sense linker is 5' 

CGGCATCCGGCTGCCCCTGCGTAGCGGTCTGGGTGGTGCTCCACTGGGTCTGCG 
TCTGCCCCGGGAGACCGACGAA G 3 '(SEQ ID No. 39). The sequence of the antisense 
linker is : 5' 

CTTCGTCGGTCTCCCGGGGCAGACGCAGACCCAGTGGAGCACCACCCAGACCG 
10 CTACGCAGGGGCAGCCGGATGCCG 3' (SEQ ID No. 40). After annealing the 

phosphorylated linkers together in 0. 1 M NaCl- 10 mM Tris, pH 7.4 they were ligated into 
unique Cla 1 and Sma I sites in Ilu-Asp2 in the vector pTAC. Por inducible expression 
using induction with isopropyl b-D-thiogalactopyranoside (IFTG), bacterial cultures were 
grown in LB broth in the presence of ampicillin at 100 ug/ml, and induced in log phase 
15 growth at an OD600 of 0.6- 1.0 with I mM IPTG for 4 hour at 37°C. The cell pellet was 
harvested by centrifugation. 

To create a vector in which the leader sequences can be removed by limited 
proteolysis with caspase 8 such that this liberates a Hu-Asp2 polypeptide beginning with 
the N-tcrminal sequence GSFV (SEQ ID Nos. 27 and 28), the following procedure was 
20 followed. Two phosphorylated oligonucleotides containing the caspase 8 cleavage site 
IETD, #571=5' 

GATCGATGACTATCTCTGACTCTCCGCTGGACTCTGGTATCGAAACCGACG 
(SEQ ID No. 41) and #572= 

GATCCGTCGGTTTCGATACCAGAGTCCAGCGGAGAGTCAGAGATAGTCATC 
25 (SEQ ID No. 42) were annealed and ligated into pET23a-i- that had been opened with 
BarnHI. After transformation into JM109, the purified vector DNA was recovered and 
orientation of the insert was confirmed by DNA sequence analysis. +, the following 
oligonucleotides were used for amplification of the selected Hu-Asp2 sequence: 
#573=5'AAGGATCCTTTGTGGAGATGGTGGACAACCTG, (SEQ ED No. 43) 
30 #554=GAAAGCTTTCATGACTCATCTGTCTGTGGAATGTTG (SEQ ID No. 44) which 
placed BarnHI and Hindlll sites flanking the 5' and 3' ends of the insert, respectively. The 
Asp2 sequence was amplified from the full length Asp2 cDNA cloned into pcDNA3. 1 
using the Advantage-GC cDNA PCR [Clontech] following the manufacturer's supplied 
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protocol using annealing & extension at 68°C in a two-step PCR cycle for 25 cycles. The 
insert and vector were cut with BamHI and Hindlll, purified by electrophoresis through an 
agarose gel, then iigated using the Rapid DNA Ligation kit [Boerhinger Mannheim]. The 
ligation reaction was used to transform the E. coli strain JM109 [Promega] and colonies 
5 were picked for the purification of plasmid (Qiagen,Qiaprep mimspin) and DNA sequence 
analysis . For inducible expression using induction with isopropyl b-D- 
thiogalactopyranoside (IPTG), the expression vector was transferred into E. coli strain 
BL21 (StaLagene). Bacterial cultures were grown in LB broth in the presence of ampicillin 
at 100 ug/ml. and induced in log phase growth at an OD600 of 0.6-1.0 with 1 mM IPTG for 
10 4 hour at 37°C. The cell pellet was harvested by centrifugation. 

To assist purification, a 6-His tag can be introduced into any of the above constructs 
following the T7 leader by opening the construct at the BamHI site and then ligating in the 
annealed, phosphorylated oligonucleotides containing the six histidine sequence 
#565=GATCGCATCATCACCATCACCATG (SEQ ID No. 45), 
15 #5 66=G ATCCATGGTGATGGTGATGATGC (SEQ ID No. 46). The 5' overhang for each 
set of oligonucleotides was designed such that it allowed ligation into the BatuHI site but 
not subsequent digestion with BamHI. 

Preparation of Bacterial Pellet: 

36.34g of bacterial pellet representing 10.8L of growth was dispersed into a total 
20 volume of 200ml using a 20mm tissue homogenizer probe at 3000 to 5000 rpm in 2M KC1, 
0.1M Tris, 0.05M EDTA, ImM DTT. The conductivity adjusted to about 193mMhos with 
water. 

35 After the pellet was dispersed, an additional amount of the KC1 solution was added, 

bringing the total volume to 500 ml. This suspension was homogenized further for about 3 

25 minutes at 5000 rpm using the same probe. The mixture was then passed through a Rannie 

high-pressure homogenizer at 10,000psi. 

In all cases, the pellet material was carried forward, while the soluble fraction was 
discaided.Tbe resultant solution wis centrifuged in a GSA rotor for lhr. at 12,500 rpm. The 
pellet was resuspended in the same solution (without the DTT) using the same tissue 
30 homogenizer probe at 2,000 rpm. After homogenizing for 5 minutes at 3000 rpm, the 
volume was adjusted to 500mJ with the same solution, and spun for lhr. at 12,500 rpm. 
50 The pellet was then resuspended as before, but this time the final volume was adjusted to 
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1.5L with the same solution prior to homogenizing for 5 minutes. After centrifuging at the 
same speed for 30 minutes, this procedure was repeated. The pellet was then resuspended 
into about 150ml of cold water, pooling the pellets from the six centrifuge tubes used in the 
GSA rotor. The pellet has homogenized for 5 minutes at 3,000 rpm, volume adjusted to 
5 250ml with cold water, then spun for 30 minutes. Weight of the resultant pellet was 
17.75g. 

Summary: Lysis of bacterial pellet in KCI solution, followed by centrifugation in a 
GSA rotor was used to initially prepare the pellet. The same solution was then used an 
additional three times for resuspension/homogenization. A final water 
K) wash/homogenization was then performed to remove excess KCI and EDTA. 
Soluhlization of rHuAsp2L: 

A ratio of 9-10ml/gram of pellet was utilized for solubilizing the rHuAsp2L from the pellet 
previously described. 17.75g of pellet was thawed, and 150ml of 8M guanidine HC1, 5mM 
(JME, 0.1% DEA, was added. 3M Tris was used to titrate the pH to 8.6. The pellet was 
15 initially resuspended into the guanidine solution using a 20mm tissue homogenizer probe at 
1000 rpm. The mixture was then stirred at 4°C for 1 hour prior to centrifugation at 
12,500rpm for 1 hour in GSA rotor. The resultant supernatant was then centrifuged for 
30min at 40,000 x g in an SS-34 rotor. The final supernatant was then stored at -20°C, 
30 except for 50ml. 

20 Immobilized Nickel Affinity Chromatography of Solubilized rHu Asp2L: 

The following solutions were utilized: 

A) 6M Guanidine HC1, 0. 1M NaP, pH 8.0, 0.01M Tris, 5mM 0ME, 0.5mM Imidazole 
A') 6M Urea, 20mM NaP, pH 6.80, 50mM NaCl 
35 B') 6M Urea, 20mM NaP, pH 6.20, 50mM NaCl, 12mM Imidazole 

25 C) 6M Urea, 20mM NaP, pH 6.80, 50mM NaCl, 300mM Imidazole 

Note: Buffers A' and C were mixed at the appropriate ratios to give intermediate 

concentrations of Imidazole. 

The 50ml of solubilized material was combined with 50ml of buffer A prior to adding to 
100-I25ml Qiagen Ni-NTA SupcrFlow (pre-equilibrated with buffer A) in a 5 x 10cm Bio- 
30 Rad econo column. This was shaken gently overnight at 4°C in the cold room. 
Chromatography Steps: 

1) Drained the resultant flow through. 

2) Washed with 50ml buffer A (collecting into flow through fraction) 

3) Washed with 250ml buffer A (wash I) 
35 4 > Washed with 250ml buffer A (wash 2) 

50 5) Washed with 250ml buffer A' 
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6) Washed with 250nil buffer B' 

7) Washed with 250mJ buffer A' 

8 ) Eluted with 250ml 75mM Imidazole 

9) Eluted with 250ml 150mM Imidazole 050-1) 

10) Elated with 250ml 150mM Imidazole 1 150-2 ) 

1 1 ) Eluted with 250ml 3O0mM Imidazole 1 300- 1 ) 

12) Eluted with 250ml 300mM Imidazole (300-2) 

13 ) Eluted with 250ml 300mM Imidazole * 300-3 ) 

Chromatography Results: 

The rHuAsp eluted at 75mM Imidazole through 300mM Imidazole. The 75mM fraction, as 
well as the first 150mM Imidazole (150-1) fraction contained contaminating proteins as 
visualized on Coomassie Blue stained gels. Therefore, fractions 150-2 and 300-1 will be 
utilized for refolding experiments since they contained the greatest amount of protein (see 
Coomassie Blue stained gel). 
Refolding Experiments of rHuAsp2L: 
Experiment I: 

Forty ml of 150-2 was spiked with 1M DTT, 3M Tris, pH 7.4 and DEA to a final 
concentration of 6mM, 50mM, and 0.1% respectively. This was diluted suddenly (while 
stirring) with 200ml of (4°C) cold 20mM NaP, pH 6.8, 150mM NaCl. This dilution gave a 
final Urea concentration of 1M. This solution remained clear, even if allowed to set open to 
the air at RT or at 4°C . 

After setting open to the air for 4-5 hours at 4°C, this solution was then dialyzed overnight 
against 20mM NaP, pH 7.4, 150mM NaCl, 20% glycerol. This method effectively removes 
the urea in the solution without precipitation of the protein. 
Experiment 2: 

Some of the 150-2 eluate was concentrated 2x on an Amicon Centriprep, 10,000 MWCO, 
then treated as in Experiment 1. This material also stayed in solution, with no visible 
precipitation. 
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Experiment 3: 

5 89ml of the 150-2 eluate was spiked with 1M DTT, 3M Tris, pH 7.4 and DEA to a final 

concentration of 6mM, 50mM, and 0.1% respectively. This was diluted suddenly (while 
stirring) with 445ml of (4°C) cold 20mM NaP, pH 6.8, 150mM NaCl. This solution 

10 5 appeared clear, with no apparent precipitation. The solution was removed to RT and stirred 

for 10 minutes prior to adding ME A to a final concentration of 0.1 mM. This was stirred 
slowly at RT for lhr. Cystamine and CuSO* were then added to final concentrations of 
ImM and L0u\M respectively. The solution was stirred slowly at RT for 10 minutes prior to 

15 

being moved to the 4°C cold room and shaken slowly overnight, open to the air. 
10 The following day, the solution (still clear, with no apparent precipitation) was 

centrifuged at 100,000 x g for 1 hour. Supernatants from multiple runs were pooled, and 
20 the bulk of the stabilized protein was dialyzed against 20mM NaP, pH 7.4, 150tnM NaCl, 

20% glycerol. After dialysis, the material was stored at -20°C. 

Some (about 10ml) of the protein solution (still in 1M Urea) was saved back for 
15 biochemical analyses, and frozen at -20 C C for storage. 

25 

Example 10. Expression of Hu-Asp2 and Derivatives in Insect Cells 

Expression by baculovirus infection — The coding sequence of Hu-Asp2 and several 
derivatives were engineered for expression in insect cells using the PCR. For the full- 
30 length sequence, a 5' -sense oligonucleotide primer that modified the translation initiation 

20 site to fit the Kozak consensus sequence was paired with a 3'-antisense primer that contains 
the natural translation termination codon in the Hu-Asp2 sequence. PCR amplification of 
the pcDNA3.1(hygro)/Hu-Asp2 template (sec Example 12). Two derivatives of Hu-Asp2 

35 

that delete the C -terminal transmembrane domain (SEQ ID No. 29 and No. 30) or delete the 
transmembrane domain and introduce a hexa-histidine tag at the C-termiuus (SEQ ID No. 
25 31 and No. 32) were also engineered using the PCR. The same 5 '-sense oligonucleotide 
40 primer described above was paired with either a 3 , -antisense primer that (1) introduced a 

translation termination codon after codon 453 (SEQ ID No. 3) or (2) incorporated a hexa- 
histidine tag followed by a translation termination codon in the PCR using 
pcDNA3.1(hygro)/IIu_Asp-2L as the template. In all cases, the PCR reactions were 

45 

30 performed amplified for 15 cycles using Pwol DNA polymerase (Boetiringer-Mannheim) as 
outlined by the supplier. The reaction products were digested to completion with BamH\ 
and Notl and ligated to BamHX and Nod digested baculovirus transfer vector pVL1393 
50 (Invitrogen). A portion of the ligations was used to transform competent E. coli DH5a cells 
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followed by antibiotic selection on LB- Amp. Plasmid DNA was prepared by standard 
alkaline lysis and banding in CsCl to yield the baculovirus transfer vectors pVL1393/Asp2, 
pVL1393/Asp2ATM and pVL1393/Asp2ATM(His) 6 . Creation of recombinant 
baculoviruses and infection of sf9 insect cells was performed using standard methods. 
5 Expression by transfection — Transient and stable expression of Hu-Asp2ATM and 

Hu-Asp2ATM(His) 6 in High 5 insect cells was performed using the insect expression vector 
pE/V5-His. The DNA inserts from the expression plasmids vectors pVL1393/Asp2, 
pVL1393/Asp2ATM and pVL1393/Asp2ATM(His) 6 were excised by double digestion with 
BamHl and Nod and subcloned into BamHl and Not\ digested pIZ/V5-His using standard 
H) methods. The resulting expression plasmids, referred to as pIZ/Hu-Asp2ATM and pIZ/Hu- 
Asp2ATM(His)6, were prepared as described above. 
20 For transfection. High 5 insect cells were cultured in High Five serum free medium 

supplemented with 10 jig/ml gentamycin at 27°C in sealed flasks. Transfections were 
performed using High five cells, High five serum free media supplemented with 10 |ig/ml 
15 gentamycin, and InsectinPlus liposomes (Invitrogen, Carlsbad, CA) using standard 
methods. 

For large scale transient transfections 1.2 x 10 7 high five cells were plated in a 150 
mm tissue culture dish and allowed to attach at room temperature for 15-30 minutes. 
30 During the attachment time the DNA/ liposome mixture was prepared by mixing 6 ml of 

20 serum free media, 60 pig Asp2ATM/pIZ (+/- His) DNA and 120 ul of Insectin Plus and 
incubating at room temperature for 15 minutes. The plating media was removed from the 
dish of cells and replaced with the DNA/liposome mixture for 4 hours at room temperature 
35 with constant rocking at 2 rpm. An additional 6 ml of media was added to the dish prior to 

incubation for 4 days at 27 °C in a humid incubator. Four days post transfection the media 
25 was harvested, clarified by centrifugation at 500 x g, assayed for Asp2 expression by 
40 Western blotting. For stable expression, the cells were treated with 50 M-g/ml Zeocin and 

the surviving pool used to prepared clonal cells by limiting dilution followed by analysis of 
the expression level as noted above. 

Purification of Hu-Asp2AT\f and Hu-Asp2ATM(His)$ — Removal of the 
30 transmembrane segment from Hu-Asp2 resulted in the secretion of the polypeptide into the 
culture medium. Following protein production by either baculovirus infection or 
transfection, the conditioned medium was harvested, clarified by centrifugation, and 
50 dialyzed against Tris-HCl (pH 8.0). This material was then purified by sucessive 
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chromatography by anion exchange (Tris-HCl, pH 8.0) followed by cation exchange 
5 chromatography (Acetate buffer at pH 4.5) using NaCl gradients. The elution profile was 

monitored by (1) Western blot analysis and (2) by activity assay using the peptide substrate 

described in-Example 12. For the Hu-Asp2ATM(His) & , the conditioned medium was 
10 5 dialyzed against Tris buffer (pH 8.0) and purified by sequential chromatography on IM AC 

resin followed by anion exchange chromatography. 
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Sequence analysis of the purified Hu-Asp2ATM(His) 6 protein revealed that the signal 
peptide had been cleaved [TQHGIRLPLR]. 

10 

Example II. Expression of Hu-Asp2 in CITO cells 

Heterologous expression ofHu_AspH in CHO-Kl cells— The entire coding sequence of 
Hu-Asp2 was cloned into the mammalian expression vector pcDNA3.1(+)Hygro 
15 (Invitrogen, Carlsbad, CA) which contains the CMV immediate early promotor and bGH 
polyadenylation signal to drive over expression. The expression plasmid, 
pcDNA3. K+)Hygro/Hu-Asp2, was prepared by alkaline lysis and banding in CsCl and 
completely sequenced on both strands to verify the integrity of the coding sequence. 



20 Wild-type Chinese hamster ovary cells (CHO-Kl) were obtained from the ATCC. The 

cells were maintained in monolayer cultures in a-MEM containing 10% FCS at 37°C in 5% 
35 C0 2 . Two 100 mm dishes of CHO-Kl cells (60% confluent) were transfecled with 

pcDNA3.1(+)/Hygro alone (mock) or pcDNA3. l(+)Hygro/Hu-Asp2 using the cationic 
liposome DOTAP as recommended by the supplier. The cells were treated with the plasmid 

40 

25 DNA/liposome mixtures for 15 hr and then the medium replaced with growth medium 
containing 500 Units/ml hygromyciti B. In the case of pcDNA3.1(+)Hygro/Hu-Asp2 
45 transfected CHO-Klcells, individual hygromycin B-resistant cells were cloned by limiting 

dilution. Following clonal expansion of the individual cell lines, expression of Hu-Asp2 
protein was accessed by Western blot analysis using a polyclonal rabbit antiserum raised 
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against recombinant Hu-Asp2 prepared by expression in E. coll. Near confluent dishes of 
each cell line were harvested by scraping into PBS and the cells recovered by 
centrifugation. The cell pellets were resuspended in cold lysis buffer (25 rru\f Tris-HCl 
(8.0)75 mM EDTA) containing protease inhibitors and the cells lysed by sonicahon. The 
5 soluble and membrane fractions were separated by centrifugation (105,000 x g, 60 min) and 
normalized amounls of protein from each fraction were then separated by SDS-PAGE. 
Following electrotransfer of the separated polypeptides to PVDF membranes, Hu_Asp-2L 
protein was detected using rabbit anti-IIu-Asp2 antiserum ( 1/1000 dilution) and the 
antibody-antigen complexes were visualized using alkaline phosphatase conjugated goat 
10 anti-rabbit antibodies (1/2500). A specific immunoreactive protein with an apparent Mr 
value of 65 kDa was detected in pcDNA3.1(+)Hygro/Hu-Asp2 transfected cells and not 
25 mock-transfected cells. Also, the Hu-Asp2 polypeptide was only detected in the membrane 

fraction, consistent with the presence of a signal peptide and single transmembrane domain 
in the predicted sequence. Based on this analysis, clone #5 had the highest expression level 
15 of Hu-Asp2 protein and this production cell lines was scaled up to provide material for 
purification. 

Purification of recombinant Hu_Aap-2L from CHO-Kl/HuAsp2 clone #5 — In a 
typical purification, clone #5 cell pellets derived from 20 150 mm dishes of confluent cells, 
were used as the starting material. The cell pellets were resuspended in 50 ml cold lysis 
40 20 buffer as described above. The cells were lysed by polytron homogenization (2 x 20 sec) 

and the lysatc centrifuged at 338,000 x g for 20 minutes. The membrane pellet was then 
resuspended in 20 ml of cold lysis buffer containing 50 mM (3-octylglucoside followed by 
rocking at 4 U C for Ihr. The detergent extract was clarified by centrifugation at 338,000 x g 
for 20 minutes and the supernatant taken for further analysis. 
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The p-octylglucoside extract was applied to a Mono Q anion exchange column that 
was previously equilibrated with 25 mM Tris-HCl (pH 8.0)/50 mM p-octylglucoside. 
Following sample application, the column was eluted with a linear gradient of increasing 
NaCl concentration (0-1.0 M over 30 minutes) and individual fractions assayed by Western 
blot analysis and for P-secretase activity (see below). Fractions containing both Hu_Asp- 
2L immunoreactivity and p-secretase activity were pooled and dialyzed against 25 mM 
NaOAc (pH 4.5)/50 mM p-octylglucoside. Following dialysis, precipitated material was 
removed by centrifugation and the soluble material chromatographed on a MonoS cation 
exchange column that was previously equilibrated in 25 mM NaOAc (pH 4.5)/ 50 mM (3- 
octylglucosidc. The column was clutcd using a linear gradient of increasing NaCl 
concentration (0-1.0 M over 30 minutes) and individual fractions assayed by Western blot 
analysis and for p-secretase activity. Fractions containing both Hu-Asp2 immunoreactivity 
and p-secretase activity were combined and determined to be >90% pure by SDS- 
PAGE/Coomassie Blue staining. 

Example 12, Assay of Hu-Asp2 P-secretase activity using peptide substrates 
P-secretase assay — P-secretase activity was measured by quantifying the hydrolysis of a 
synthetic peptide containing the APP Swedish mutation by RP-HPLC with UV detection. 
Each reaction contained 50 mM Na-MES (pH 5.5), 1% P-octylglucoside, peptide substrate 
(SEVNLDAEFR, 70 ^iM) and enzyme ( i-5 \xg protein). Reactions were incubated at 37 °C 
for various times and the reaction products were resolved by RP-HPLC using a linear 
gradient from 0-70 B over 30 minutes (A=0. 1 % TFA in water, 

B=).l%TFA/10%water/90%AcCN). The elution profile was monitored by absorbance at 
214 nm. In preliminary experiments, the two product peaks which eluted before the intact 
peptide substrate, were confirmed to have the sequence DAEFR and SEVNL using both 
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Edman sequencing and MADLI-TOF mass spectrometry. Percent hydrolysis of the peptide 

5 

substrate was calculated by comparing ihe integrated peak areas for the two product 
peptides and the starting material derived from the absorbance at 2 14 nm. The specificity 
10 of the protease cleavage reaction was determined by performing the (3-secretase assay in the 

5 presence of a cocktail of protease inhibitors (8 [xM pepstatin A, 10 leupeptin, 10 (J.M 
H64, and5mM EDTA). 

15 

An alternative p-secretase assay utilizes internally quenched fluorescent substrates 
to monitor enzyme activity using fluorescence spectroscopy in a single sample or multiweli 
format. Each reaction contained 50 mM Na-MES (pH 5.5). peptide substrate MCA- 
20 to EVKMDAEF[K-DNP] (BioSource International) (50 uM) and purified Hu-Asp-2 enzyme. 

These components were equilibrated to 37 °C for various times and the reaction initiated by 
addition of substrate. Excitation was performed at 330 nm and the reaction kinetics were 
monitored by measuring the fluorescence emission at 390 nm. To detect compounds that 
modulate Hu-Asp-2 activity, the test compounds were added during the preincubation phase 
15 of the reaction and the kinetics of the reaction monitored as described above. Activators are 
scored as compounds that increase the rate of appearance of fluorescence while inhibitors 
30 decrease the rate of appearance of fluorescence. 

It will be clear that the invention may be practiced otherwise than as particularly 
described in the foregoing description and examples. 
20 Numerous modifications and variations of the present invention are possible in light of the 
above teachings and, therefore, are within the scope of the invention. 
The entire disclosure of all publications cited herein are hereby incorporated by reference. 
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What is claimed is: 

1 . Any isolated or purified nucleic acid polynucleotide that codes for a protease 

capable of cleaving the beta (p) secretase cleavage site of APP that contains two or 
5 more sets of special nucleic acids, where the special nucleic acids are separated by 

nucleic acids that code for about 100 to 300 amino acid positions, where the amino 
acids in those positions may be any amino acids, where the first set of special 
nucleic acids consists of the nucleic acids that code for the peptide DTG, where the 
first nucleic acid of the first special set of nucleic acids is, the first special nucleic 
10 acid, and where the second set of nucleic acids code for either the peptide DSG or 

DTG, where the last nucleic acid of the second set of nucleic acids is the last special 
2Q nucleic acid, with the proviso that the nucleic acids disclosed in SEQ ID NO. 1 and 

SEQ. ID NO. 5 are not included. 
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15 2. The nucleic acid polynucleotide of claim I where the two sets of nucleic acids are 
separated by nucleic acids that code for about 125 to 222 amino acid positions, 
which may be any amino acids. 

3. The nucleic acid polynucleotide of claim 2 that code for about 150 to 172 amino 
20 acid positions, which may be any amino acids. 

4. The nucleic acid polynucleotide of claim that code for about 172 amino acid 
positions, which may be any amino acids. 



25 5. The nucleic acid polynucleotide of claim 4 where the nucleotides are described in 
40 SEQ. ID. NO. 3 

6. The nucleic acid polynucleotide of claim 2 where the two sets of nucleic acids are 
separated by nucleic acids that code for about 150 to 196 amino acid positions. 



30 



The nucleic acid polynucleotide of claim 6 where the two sets of nucleotides are 
separated by nucleic acids that code for about 196 amino acids (positions). 
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8. The nucleic acid polynucleotide of claim 7 where the two sets of nucleic acids arc 
separated by the same nucleic acid sequences that separate the same set of special 
nucleic acids in SEQ. ID. NO. 5. 

9. The nucleic acid polynucleotide of claim 4 where the two sets of nucleic acids are 
separated by nucleic acids that code for about 150 to 190, amino acid (positions). 

10. The nucleic acid polynucleotide of claim 9 where the two sets of nucleotides are 
separated by nucleic acids that code for about 190 amino acids (positions). 

1 1 . The nucleic acid polynucleotide of claim 10 where the two sets of nucleotides are 
separated by the same nucleic acid sequences that separate the same set of special 
nucleotides in SEQ. ID. NO. 1. 

12. Claims l-l 1 where the first nucleic acid of the first special set of amino acids, that 
is, the first special nucleic acid, is operably linked to any codon where the nuclic 
acids of that codon codes for any peptide comprising from 1 to 10,000 amino acid 
(positions). 

13. The nucleic acid polynucleotide of claims 1-12 where the first special nucleic acid is 
operably linked to nucleic acid polymers that code for any peptide selected from the 
group consisting of: any any reporter proteins or proteins which facilitate 
purification. 

14. The nucleic acid polynucleotide of claims 1-13 where the first special nucleic acid is 
operably linked to nucleic acid polymers that code for any peptide selected from the 
group consisting of: immunoglob in-heavy chain, maltose binding protein, glutathion 
S transfection, Green Fluorescent protein, and ubiquitin. 

15. Claims 1-14 where the last nucleic acid of the second set of special amino acids, that 
is, the last special nucleic acid, is operably linked to nucleic acid polymers that code 
for any peptide comprising any amino acids from 1 to 10,000 amino acids. 
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16. Claims 1-15 where the last special nucleic acid is operably linked to any codon 
5 linked to nucleic acid polymers that code for any peptide selected from the group 

consisting of: any reporter proteins or proteins which facilitate purification. 

10 5 17. The nucleic acid polynucleotide of claims 1-16 where the first special nucleic acid is 

operably linked to nucleic acid polymers that code for any peptide selected from the 
group consisting of: immunoglobin-heavy chain, maltose binding protein, glutathion 
S transaction, Green Fluorescent protein, and ubiquitin. 

15 

10 18. * Any isolated or purified nucleic acid polynucleotide that codes for a protease 
capable of cleaving the beta secretasc cleavage site of APP that contains two or 
20 more sets of special nucleic acids, where the special nucleic acids are separated by 

nucleic acids that code for about 100 to 300 amino acid positions, where the amino 
acids in those positions may be any amino acids, where the first set of special 
15 nucleic acids consists of the nucleic acids that code for DTG, where the first nucleic 

25 

acid of the first special set of nucleic acids is, the first special nucleic acid t and 
where the second set of nucleic acids code for either DSG or DTG, where the last 
nucleic acid of the second set of special nucleic acids is the last special nucleic acid, 
30 where the First special nucleic acid is operably linked to nucleic acids that code for 

20 any number of amino acids from zero to 8 1 amino acids and where each of those 

codons may code for any amino acid. 

35 

19. The nucleic acid polynucleotide of claim 18 , where the first special nucleic acid is 
operably linked to nucleic acids that code for any number of from 64 to 77 amino 

25 acids where each codon may code for any amino acid. 

40 

20. The nucleic acid polynucleotide of claim 19 , where the first special nucleic acid is 
operably linked to nucleic acids that code for about 71 amino acids peptide. 

45 

30 21. The nucleic acid polynucleotide of claim 20, where the first special nucleic acid is 

operably linked to 71 amino acid peptide and where the first of those 71 amino acids 
is the amino acid T. 
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22. The nucleic acid polynucleotide of claim 21, where the polynucleotide comprises a 
sequence that is at least 95% identical to the same corresponding amino acids in 
SEQ. ID. NO. 3, that is, identical to the sequences in SEQ. ID. NO. 3 including the 
sequences from both the first and or the second special nucleic acids, toward the N- 
Terminal, through and including 71 amino acids, see Example 10, beginning from 
the DTG site and including the nucleotides from that code for 7 1 amino acids). 

23. The nucleic acid polynucleotide of claim 22, where the complete polynucleotide 
comprises identical to the same corresponding amino acids in SEQ. ID. NO. 3, that 
is, identical to the sequences in SEQ. ID. NO. 3 including the sequences from both 
the first and or the second special nucleic acids, toward the N-Terminal, through 
and including 71 amino acids, sec Example 10, beginning from the DTG site and 
including the nucleotides from that code for 71 amino acids). 

24. The nucleic acid polynucleotide of claim 18 , where the first special nucleic acid is 
operably linked to nucleic acids that code for any number of from about 30 to 54 
amino acids where each codon may code for any amino acid. 

25. The nucleic acid polynucleotide of claim 20, where the first special nucleic acid is 
operably linked to 47 codons where the first those 35 or 47 amino acids is the 
amino acid E or G. 

26. The nucleic acid polynucleotide of claim 2 1 . where the polynucleotide comprises a 
sequence that is at least 95% identical to the same corresponding amino acids in 
SEQ. ID. NO. 3, that is, identical to that portion of the sequences in SEQ. ID. NO. 
3 including the sequences from both the first and or the second special nucleic 
acids, toward the N-Terminal, through and including 35 or 47 amino acids, see 
Example 1 1 for the 47 example, beginning from the DTG site and including the 
nucleotides from that code for the previous 35 or 47 amino acids before the DTG 
site). 
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27. The nucleic acid polynucleotide of claim 22, where the polynucleotide comprises 
identical to the same corresponding amino acids in SEQ. ID. NO- 3, that is, 
identical to the sequences in SEQ. ID. NO. 3 including the sequences from both the 
first -and or the second special nucleic acids, toward the N-Terminal, through and 
including 35 or 47 amino acids, see Example 1 1 for the 47 example, beginning from 
the DTG site and including the nucleotides from that code for the previous 35 or 47 
amino acids before the DTG site). 

28. * Any isolated or purified nucleic acid polynucleotide that codes for a protease 
capable of cleaving the beta (P) secretase cleavage site of APP that contains two or 
more sets of special nucleic acids, where the special nucleic acids are separated by 
nucleic acids that code for about 100 to 300 amino acid positions, where the amino 
acids in those positions may be any amino acids, where the first set of special 
nucleic acids consists of the nucleic acids that code for the peptide DTG, where the 
first nucleic acid of the first special set of amino acids is, the first special nucleic 
acid, and where the second set of special nucleic acids code for either the peptide 
DSG or DTG, where the last nucleic acid of the second set of special nucleic acids, 
the last special nucleic acid, is operably linked to nucleic acids that code for any 
number of codons from 50 to 170 codons. 

29. The nucleic acid polynucleotide of claim 29 where the last special nucleic acid is 
operably linked to nucleic acids comprising from 100 to 170 codons. 

30. The nucleic acid polynucleotide of claim 30 where the last special nucleic acid is 
operably linked to nucleic acids comprising from 142 to 163 codons. 

3 1 . The nucleic acid polynucleotide of claim 3 1 where the last special nucleic acid is 
operably linked to nucleic acids comprising about 142 codons. 

32. The nucleic acid polynucleotide of claim 32 where the polynucleotide comprises a 
sequence that is at least 95% identical to SEQ. ID. # (Example 9 or 10). 
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33. The nucleic acid polynucleotide of claim 33, where the complete polynucleotide 
comprises SEQ ID. # (Example 9 or 10). 

34. The nucleic acid polynucleotide of claim 3 1 where the last special nucleic acid is 
5 operably linked to nucleic acids comprising about 163 codons. 

35. The nucleic acid polynucleotide of claim 35 where the polynucleotide comprises a 
sequence that is at least 95% identical to SEQ. ID. # (Example 9 or 10). 

10 36. The nucleic acid polynucleotide of claim 36, where the complete polynucleotide 
comprises SEQ. ID. # (Example 9 or 10). 

37. The nucleic acid polynucleotide of claim 3 1 where the last special nucleic acid is 
operably linked to nucleic acids comprising about 170 codons. 

15 

38. Claims 1-38 where the second set of special nucleid acids code for the peptide DSG, 
and optionally the first set of nucleic acid polynucleotide is operably linked to a 
peptide purification tag. 

20 39. Claims 1-39 where the nucleic acid polynucleotide is operably linked to a peptide 
purification tag which is six histidine. 

40. Claims 1-40 where the first set of special nucleic acids are on one polynucleotide 
and the second set of special nucleic acids are on a second polynucleotide, where 

25 both first and second polynucleotides have at lease 50 codons. 

41. Claims 1-40 where the first set of special nucleic acids are on one polynucleotide 
and the second set of special nucleic acids are on a second polynucleotide, where 
both first and second polynucleotides have at lease 50 codons where both said 

30 polynucleotides are in the same solution. 

42. A vector which contains a polynucleotide described in claims 1-42. 
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43. A cell or cell line which contans a polynucleotide described in claims 1-42. 



10 



15 



44. Any isolated or purified peptide or protein comprising an amino acid polymer mat is 
a protease capable of cleaving the beta (0) secrctase cleavage site of APP that 

5 contains two or more sets of special amino acids, where the special amino acids are 

separated by about 100 to 300 amino acid positions, where each amino acid 
position can be any amino acid, where the first set of special amino acids consists of 
the peptide DTG, where the first amino acid of the first special set of amino acids is, 
the first special amino acid, where the second set of amino acids is selected from the 
10 peptide comprising either DSG or DTG, where the hist amino acid of the second set 

of special amino acids is the last special amino acid, with the proviso that the 
20 proteases disclosed in SEQ ID NO. 2 and SEQ. ID NO. 6 are not included. 

45. The amino acid polypeptide of claim 45 where the two sets of amino acids are 

15 separated by about 125 to 222 amino acid positions where in each position it may be 

any amino acid. 



25 



46. The amino acid polypeptide of claim 46 where the two sets of amino acids are 
30 separated by about 150 to 172 amino acids. 

20 

47. The amino acid polypeptide of claim 47 where the two sets of amino acids are 
separated by about 172 amino acids. 

48. The amino acid polypeptide of claim 48 where the protease is described in SEQ. ID. 
25 NO. 4 

49. The ammo acid polypeptide of claim 46 where the two sets of amino acids are 
separated by about 150 to 196 amino acids. 

30 50. The amino acid polypeptide of claim 50 where the two sets of amino acids are 
separated by about 196 amino acids. 
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5 1 . The amino acid polypeptide of claim 5 1 where the two sets of amino acids are 
separated by the same amino acid sequences that separate the same set of special 
amino acids in SEQ. ID. NO. 6. 

52. The amino acid polypeptide of claim 46 where the two sets of amino acids are 
separated by about 1 50 to 190, amino acids. 

53. The amino acid polypeptide of claim 53 where the two sets of nucleotides are 
separated by about 190 amino acids. 

54. The amino acid polypeptide of claim 54 where the two sets of nucleotides are 
separated by the same amino acid sequences that separate the same set of special 
amino acids in SEQ. ID. NO. 2. 

55. Claims 45-55 where the first amino acid of the first special set of amino acids, that 
is, the first special amino acid, is operably linked to any peptide comprising from 1 
to 10,000 amino acids 

56. The amino acid polypeptide of claims 45-56 where the first special amino acid is 
operably linked to any peptide selected from the group consisting of: any any 
reporter proteins or proteins which facilitate purification. 

57. The amino acid polypeptide of claims 45-57 where the first special amino acid is 
operably linked to any peptide selected from the group consisting of: 
immunoglobin-heavy chain, maltose binding protein, giutathion S transfection, 
Green Fluorescent protein, and ubiquitin. 

58. Claims 45-58, where the last amino acid of the second set of special amino acids, 
that is, the last special amino acid, is operably linked to any peptide comprising any 
amino acids from 1 to 10.000 amino acids. 
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59. Claims 45-59 where the last special amino acid is operably linked any peptide 
selected from the group consisting of any reporter proteins or proteins which 
facilitate purification. 

5 60. The amino acid polypeptide of claims 45-60 where the first special amino acid is 
operably linked to any peptide selected from the group consisting of: 
immunoglob in-heavy chain, maltose binding protein, glutathion S transaction, 
Green Fluorescent protein, and ubiquitin. 

10 61. * Any isolated or purified peptide or protein comprising an amino acid 

polypeptide that codes for a protease capable of cleaving the beta secretase cleavage 
site of APP that contains two or more sets of special amino acids, where the special 
amino acids are separated by about 100 to 300 amino acid positions, where each 
amino acid in each position can be any amino acid, where the first set of special 
15 amino acids consists of the amino acids DTG, where the first amino acid of the first 

special set of amino acids is, the first special amino acid, D, and where the second 
set of amino acids is either DSG or DTG, where the last amino acid of the second 
set of special amino acids is the last special amino acid, G, where the first special 
amino acid is operably linked to amino acids that code for any number of amino 
20 acids from zero to 8 1 amino acid positions where in each position it may be any 

amino acid. 

62. The amino acid polypeptide of claim 62, where the first special amino acid is 
operably linked to a peptide from about 30 to 77 amino acids positions where each 

25 amino acid position may be any amino acid. 

63. The amino acid polypeptide of claim 63, where the first special amino acid is 
operably linked to a peptide of 35, 47, 71, or 77 amino acids. 

45 30 64. The amino acid polypeptide of claim 63, where the first special amino acid is 

operably linked to the same corresponding peptides from SEQ. ID. NO. 3 that are 
35, 47, 71, or 77 peptides in length, beginning counting with the amino acids on the 
first special sequence, DTG, towards the N-terminal of SEQ. ID. NO. 3. 
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65. The amino acid polypeptide of claim 65, where the polypeptide comprises a 

sequence that is at least 95% identical to the same corresponding amino acids in 
SEQ. ID. NO. 4, that is, identical to that portion of the sequences in SEQ.ID. NO. 4, 
10 5 including all the sequences from both the first and or the second special nucleic 

acids, toward the N- terminal, through and including 71, 47, 35 amino acids before 
the first special amino acids. (Examples 10 and 1 1). 



15 



20 



25 



66. The amino acid polypeptide of claim 65, where the complete polypeptide comprises 
10 the peptide of 71 amino acids, where the first of the amino acid is T and the second 

isQ. 

67. The amino acid polypeptide of claim 62, where the first special amino acid is 
operably linked to any number of from 40 to 54 amino acids (positions) where each 

15 amino acid position may be any amino acid. 



68. The amino acid polypeptide of claim 68, where the iirst special amino acid is 
operably linked to amino acids that code for a peptide of 47 amino acids. 

30 

20 69. The amino acid polypeptide of claim 69, where the first special amino acid is 

operably linked to a 47 amino acid peptide where the first those 47 amino acids is 
the amino acid E, 

35 

70. The amino acid polypeptide of claim 70, where the polypeptide comprises a 
25 sequence that is at least 95% identical to SEQ. ID. # (Example 10). 

40 

71. The amino acid polypeptide of claim 71, where the complete polypeptide comprises 
SEQ. ID. # (Example 10). 

45 

30 72. * Any isolated or purified amino acid polypeptide that is a protease capable of 
cleaving the beta (fS) secretase cleavage site of APP that contains two or more sets 
of special amino acids, where the special amino acids are separated by about 100 to 
50 300 amino acid positions, where each amino acid in each position can be any amino 
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acid, where the first set of special amino acids consists of the amino acids that code 
for DTG, where the first amino acid of the first special set of amino acids is, the 
first special amino acid, D, and where the second set of amino acids are either DSG 
or DTG, where the last amino acid of the second set of special amino acids is the 
last special amino acid, G, which is operably linked to any number of amino acids 
from 50 to 170 amino acids, which may be any amino acids. 

73. The amino acid polypeptide of claim 73 where the last special amino acid is 
operably linked to a peptide of about 100 to 170 ammo acids. 



10 



74. The amino acid polypeptide of claim 74 where the last special amino acid is 
20 operably linked to to a peptide of about 142 to 163 amino acids. 

75. The amino acid polypeptide of claim 75 where the last special amino acid is 
15 operably linked to to a peptide of about about 142 amino acids. 



76. The amino acid polypeptide of claim 76 where the polypeptide comprises a 
sequence that is at least 95% identical to SEQ. ED. # (Example 9 or 10). 

20 77. The amino acid polypeptide of claim 75 where the last special amino acid is 
operably linked to a peptide of about 163 amino acids. 



78. The amino acid polypeptide of claim 79 where the polypeptide comprises a 
sequence that is at least 95% identical to SEQ. ID. # (Example 9 or 10). 

25 

40 79. The amino acid polypeptide of claim 79, where the complete polypeptide comprises 

SEQ. ID. # (Example 9 or 10). 



80. The amino acid polypeptide of claim 74 where the last special amino acid is 
30 operably linked to to a peptide of about 170 amino acids. 



8 1 . Claim 46-8 1 where the second set of special amino acids is comprised of the peptide 
50 with the amino acid sequence DSG. 
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82. Claims 45-82 where the amino acid polypeptide is operably linked to a peptide 
purification tag. 

5 83. Claims 45-83 where the amino acid polypeptide is operably linked to a peptide 
purification tag which is six histidine. 

84. Claims 45-84 where the first set of special amino acids are on one polypeptide and 
the second set of special amino acids are on a second polypeptide, where both first 

10 and second polypeptide have at lease 50 amino acids, which may be any amino 

acids. 

85. Claims 45-84 where the first set of special amino acids are on one polypeptide and 
the second set of special amino acids are on a second polypeptide, where both first 

15 and second polypeptides have at lease 50 amino acids where both said polypeptides 

are in the same vessel. 

86. A vector which contains a polypeptide described in claims 45-86. 
20 87. A cell or cell line which contans a polynucleotide described in claims 45-87. 

88. The process of making any of the polynucleotides, vectors, or cells of claims 1-44 

89. The process of making any of the polypeptides, vectors' or cells of claims 45-88 

25 

40 90. Any of the polynucleotides, polypeptides, vectors, cells or cell lines described in 

claims 1-88 made from the processes described in claims 89 and 90. 
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30 91. * An isolated nucleic acid molecule comprising a polynucleotide, said 

polynucleotide encoding a Hu-Asp polypeptide and having a nucleotide sequence at least 
95% identical to a sequence selected from the group consisting of: 

50 
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(a) a nucleotide sequence encoding a Hu-Asp polypeptide selected from 
the group consisting of Hu-Aspl, Hu-Asp2(a), and Hu-Asp2(b), wherein said Hu-Aspl, 
Hu-Asp2(a) and Hu-Asp2(b) polypeptides have the complete amino acid sequence of SEQ 
ID NO:2, SEQ ID NO:4, and SEQ ID No:6, respectively; and 
5 (b) a nucleotide sequence complementary to the nucleotide sequence 

of (a). 

92. The nucleic acid molecule of claim 92, wherein said Hu-Asp polypeptide is Hu- 
Aspl, and said polynucleotide molecule of 1(a) comprises the nucleotide sequence of SEQ 

10 IDNO:l. 

93. The nucleic acid molecule of claim 92, wherein said Hu-Asp polypeptide is Hu- 
Asp2(a), and said polynucleotide molecule of 1(a) comprises the nucleotide sequence of 
SEQ ID NO:4. 

15 

25 94. The nucleic acid molecule of claim 92, wherein said Hu-Asp polypeptide is Hu- 

Asp2(b), and said polynucleotide molecule of 1(a) comprises the nucleotide sequence of 
SEQ ID NO:5. 
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20 95. An isolated nucleic acid molecule comprising polynucleotide which hybridizes 

under stringent conditions to a polynucleotide having the nucleotide sequence in (a) or (b) 
of claim 92. 



25 



30 



96. A vector comprising the nucleic acid molecule of claim 96. 

97. The vector of claim 97, wherein said nucleic acid molecule is operably linked to a 
promoter for the expression of a Hu-Asp polypeptide. 

98. The vector of claim 97, wherein said Hu-Asp polypeptide is Hu-Aspl. 

99. The vector of claim 97, wherein said Hu-Asp polypeptide is Hu-Asp2(a). 

100. The vector of claim 97, wherein said Hu-Asp polypeptide is Hu-Asp2(b). 
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101. A host cell comprising the vector of claim 98. 

102. A method of obtaining a Hu-Asp polypeptide comprising culturing the host cell of 
5 claim 102 and isolating said Hu-Asp polypeptide. 

103. An isolated Hu-Asp 1 polypeptide comprising an amino acid sequence at least 95% 
identical to a sequence comprising the amino acid sequence of SEQ ID NO;2. 

10 104. An isolated Hu-Asp2(a) polypeptide comprising an amino acid sequence at least 
95% identical to a sequence comprising the amino acid sequence of SEQ ID NO:4. 

105. An isolated Hu-Asp2(a) polypeptide comprising an amino acid sequence at least 
95% identical to a sequence comprising the amino acid sequence of SEQ ID NO: 8. 

15 

106 An isolated antibody that binds specifically to the Hu-Asp polypeptide of any of 
claims 104-107. 

sequence comprising the amino acid sequence of SEQ FD NO:8. 

20 107 An isolated antibody that binds specifically to the Hu-Asp polypeptide of any of 
claims 104-107. 

35 108. * A method to identify a cell that can be used to screen for inhibitors of 0 

secretase activity comprising: 
25 a) identifying a cell that expresses a protease capable of cleaving APP at the P 

secretase site, 
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comprising: 

i) collect the cells or the supernantent from the cells to be identified 

ii) measure the production of a critical peptide, where the critical 

30 peptide is selected from the group consisting of either the APP C- 

terminal peptide or soluble APP, 

iii) select the cells which produce the critical peptide. 
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109. The method of claim 108 where the cells are collected and the critical peptide is the 
5 APP C-terrrunal peptide created as a result of the P secretase cleavage. 

1 10. The method of claim 108 where the supemantent is collected and the critical peptide 
10 5 i s soluble APP where the soluble APP has a C-terminal created by P secretase cleavage. 

111. The method of claim 108 where the cells contain any of the nucleic acids or 
polypeptides of claims 1-86 and where the cells are shown to cleave the P secretase site of 
any peptide having the following peptide structure, P2, P 1 , P 1 \ P2\ where P2 is K or N, 

10 where PI is M or L, where PT is D, where P2' is A. 

20 112. The method of claim ill where P2 is K and PI is M. 

1 1 3 The method of claim 1 12 where P2 is N and PI is L. 
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15 

1 14 * Any bacterial cell comprising any nucleic acids or peptides in claims 1-86 
and 92-107. 



30 1 15 A bacterial cell of claim 1 14 where the bacteria is Ecoii. 

20 

1 16 Any eukaryotic cell comprising any nucleic acids or polypeptides in claims 1 -86 and 
92-107. 



1 17 * Any insect cell comprising any of the nucleic acids or polypeptides in claims 
25 1-86 and 92-107. 

118 A insect cell of claim 1 17 where the insect is sfi?, or High 5. 

1 19 A insect cell of claim 100 where the insect ceil is High 5. 

30 

120 A mammalian cell comprising any of the nucleic acids or polypeptides in claims 1- 
86 and 92-107. 
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121 A mammalian cell of claim 120 where the mammalian cell is selected from the 
group consisting of, human, rodent, lagomorph, and primate. 

122 A mammalian cell of claim 121 where the mammalian cell is selected from the 
5 group consisting of human cell. 

123 A mammalian cell of claim 122 where the human cell is selected from the group 
comprising HEK293, and IMR-32. 

10 124 A mammalian cell of claim 121 where the cell is a primate cell. 

125 A primate cell of claim 124 where the primate cell is a COS-7 ceil. 

126 A mammalian cell of claim 121 where cell is selected from a rodent cells. 

15 

127 A rodent cell of claim 126 selected from, CHO-K1, Neuro2A, 3T3 cells. 

128 A yeast cell of claim 115. 

20 1 29 An avian cell of claim 1 15. 

130, * Any isofonn of APP where the last two carboxy terminus amino acids of that 
isoform are both lysine residues. 

25 131 The isoform of APP from claim 130 comprising the isoform known as APP695 
modified so that its last two having two lysine residues as its last two carboxy terminus 
amino acids. 

132 The isoform of claim 131 comprising SEQ. ID. 16. 

30 

133 The isofonn variant of claim 1301 comprising SEQ. ID. NO. 18, and 20. 
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134 Any eukaryotic cell line, comprising nucleic acids or polypeptides of claim 130- 
133. 



135 Any cell line of claim 134 that is a mammaliam cell line (HEK293, Neuro2a, are 
preferred plus any others.) 

136 A method for identifying inhibitors of an enzyme that cleaves the beta secrctase 
cleavabe site of APP comprising: 

a) culturing cells in a culture medium under conditions in which the enzyme 
causes processing of APP and release of amyloid beta-peptide 
into the medium and causes the accumulation of CTF99 fragments of APP in cell 
lysates, 

b) exposing the cultured cells to a test compound; and specifically 
detennining whether the test compound inhibits the function of the enzyme by 
measuring the amount of amyloid beta-peptide released into the 

medium and or the amount of CTF99 fragments of APP in cell lysates; 

c) identifying test compounds diminishing the amount of soluble amyloid beta 
peptide present in the culture medium and diminution of CTF99 fragments of APP in cell 
lysates as Asp2 inhibitors. 

137 The method of claim 136 wherein the cultured cells are a human, rodent or insect 
cell line. 

138 The method of claim 137 wherein the human or rodent cell line exhibits p secretase 
activity in which processing of APP occurs with release of amyloid beta-peptide into the 
culture medium and accumulation of CTF99 in cell lysates. 

139. A method as in claim 138 wherein the human or rodent cell line treated with the 
antisense oligomers directed against the enzyme that exhibits p secretase activity, reduces 
release of soluble amyloid beta-peptide into the culture medium and accumulation of 
CTF99 in cell lysates. 
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140. A method for the identification of an agent that decreases the activity of a Hu-Asp 
polypeptide selected from the group consisting of Hu-Asp 1, Hu-Asp2(a), and Hu-Asp2(b), the 
method comprising 

(a) determining the activity of said Hu-Asp polypeptide in the presence of a test 
agent and in the absence of a test agent; and 

(b) comparing the activity of said Hu-Asp polypeptide determined in the 
presence of said test agent to the activity of said Hu-Asp polypeptide 
determined in the absence of said test agent; 

whereby a lower level of activity in the presence of said test agent than in the absence of said 
test agent indicates that said test agent has decreased the activity of said Hu-Asp polypeptide.. 

141 . The nucleic acids, peptides, proteins, vectors, cells and cell lines, and assays described 
herein. 
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FIGURE 1 (1) 

ATGGCCGCACTGGCC CGGGCGC 7GCTGCTGCCTCTGCTGGCCCAGTGGCTCCTGCGCGCC 
M G A L A K A L L L P L I, A Q W L L K A 

C C 'J C GGAGCT GXJCCCCCGC GCC CTTCAC GCTG C CC C TCCGGGTGGC C GCGGC C AC G AAC 
AP2LAPAPFTLPLRVAAATN 

CGCGTACTTCCGCCCACCCCGGGACCCGGGACCCC?GCCGAGCGCCACGCCGACGGCTTG 
RVVAPTPGPGTPAERHADGL 

GCGCTCGCCCTG3AGCCTGCCCTGGCGTCCCCCGCGGGCGCCGGCAACTTCTTGGCCATG 
ALALEPALASPAGAANFLAM 

GTAG AC A/iCCTGCAGGGGGACTCTGGCCGCGGCTACT AC CTGG AG ATGCTGATCGGG AC C 
VDNLQGDSGRGYYLEMLIGT 

CCC C CGC AG AAGC TAG AG A TTCTC G7TGACAC TGG AAG CAGTAAC TTTC-C C GTGG C AGGA 
P P Q K L Q I LVDTGS SNFAVAG 

ACCC CGC ACTCCTAC AT AGACACGTACTTTG AC AC AGAGAGGTCTAGC AC AT ACCGCTCC 
'1 P H £ Y IDTYFDTFFSSTYR £ 

AAGGGCTTTGACGTCACAGTGAAGTACACACAAGGAAGCTGGACGGGCTTCGTTGGGGAA 
KGFDVTVKYTQGSWTGFVGE 

GACCTCGTCACCATCCCCAAAGGCTTCAATACTTCTTTTCTTGTCAACATTGCCACTATT 
;j L V T I P K G F N T S V L V II I A T I 

TTTGAATC AG AGAATTTC TTTTTGC C TGG G ATT AAATG GAATGG AATA C TTGG C C TAGCT 
FESENFFLPGIKWKGILGLA 

TATGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGGTGACA 
YATLAKPSSSLETFFDSLVT 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATX3TGTGGAGCCGGCTTGCCCGTTGCT 
Q A M I P K V F S M v M C G A G L P V A 

GGATCTGGGACCAACC^AGGTAGTCTTCTCTTG^GTGGAATTCAACCAAGTT/TGTATAAA 
GJGTNGGSLVLGGIEPSLYK 

GGAG AC ATC TGGTATAC C CCTATTAAGGAAC-AGTGGTACTAC C A3ATAG AAATTC TG AAA 
G *J T W Y T P I K E E W Y Y Q I E T I, K 

TTGGAAA'r^GGAGGCCAAAGCCTTAATCTGGACTGCAGAGAGTATAACGCAGACAAGGCC 
LETGGQSLNLDCKEYNADKA 

ATCGTGGACAGTGGCACCACGC1GCTGCGCCTGCCCCAGAAGGTGTTTGATGCGGTGGTG 
T V D K G T T L L F L ? Q K V F D A V V 

G AAGC TGTGG CCCGCGC ATC TC TG ATT C C AG AATT CT C TG A TGGTTTC TG G A CTGG G TC C 
EAVAKASLIPEFSDGFWTGS 

CAGCTGGCGTGCTGGACGAATTCGGAAACACCTTG3TCTTACTTCCCTAAAATCTCCATC 
OLACWTNSETPWSYFPKISI 

TAC ^TGAGAGATGAGAACTCCAGCAGGTCATTCCGTATCACAAT'CCTGCCTCAGCTTTAC 
YLRDENSSRSFRITILPQLY 

ATTCAGCCCATGATGGGG<3CCGGCCT3AATTATGAATGTTACCGATTCGGCATTTCCCCA 
1 2PXM3AGLNYECYRFGISP 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGC7TCTACGTCATCTTCGAC 
STNALVIGATVMEGFYVIFD 

AGAGCCCAGAAGAGGGTGGGCTTCGCAGCGAGCCCCTGTGCAGAAATTGCAGGTGCTGCA 
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FIGURE 1 (2) 



R AQKRVGFAASPGAKT A G A A 

G [ :X3TCTCA^TTTCCGGGCC?TTCTCAArAGAGGATGTAGCCAGCAACTC70TCCCCGCT 
VEEISGPFSTEDVASNCVPA 

CAGTC'ITTGAGCGAGCCCATTTTGl'GGArTGTGTCCTATGCGCTCATGAGCGTCTGTGGA 
QSLSEPILWIVSYALM3VCG 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCTOCCGTTrGGaTGTCAGrGTGnrCCr 
AILLVLIVLLLLPFKCQRRP 

CGTGACCCTGAGGTCGTCAATGATGAGTCCTCTCTGGTCAGACATCGCTGGAAATGAATA 
RDPiVVMDESSLVRHRWK 

GCCAGGCCTGACCTCAAG^AACCATGAACTCAGCTATTAAGAAAATCACATTTCCAGGGC 
AGCAGCCGGGATCGATGGTGGCGCTTTCTCCTGTGCCCACCCGTCTTCAATCTCTGTTCT 
GCTCCCAGATGCCTTCTAGATTGAGTCT^T'T^^GATTCTTGATTTTCAAGC^TCAAA'rC 
C TC CCTAC TTC C AAGAAAAATAATTAAAAAAAAAACTT3ATTCTAAAC 2 AAAAAAAAAAA 
AAAA 
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FIGURE 2 (l) 



ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 

m a q a l pwlllwmgagvlpah 

ggcacc::agcacggcatccggctgc:ccctgcgcagcggcctggggggcgcccccctgggg 
gtqkg irlplrsglggaplg 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 
LRLPRETDEEPEEPGRRGSF 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 
VSMVDNLRGKSGQGYYVEMT 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 
VGSPPQTLNILVDTGSSNFA 

GTGGGTX3CTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 
VGAAPHPFLHRYYQRQLSST 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 
YRDLRKGVYVPYTQGKWEGE 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 
LGTDLVSIPHGPNVTVRANI 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATGCTG 
AAITESDKFFINGSNWEGIL 

GGGClGGCCTATGCTGAGATTGCCAGGCTlTG'rGGTGCTGGCTTCCCCCTCAACCAGTCT 
GLAYAEIARLCGAGFPLNQS 

GAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATCGACCACTCGCTGTAC 
EVLASVGGSMI IGGIDHSLY 

ACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTATGAGGTGATCATTGTG 
TGSLWYTPIRREWYYEVIIV 

CGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAGTACAACTATGACAAG 
RVEINGQDLKMDCKEYNYDK 

AGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCTGCA 
SIVDSGTTNLRLPKKVFEAA 

GTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGA 
VKSIKAASSTEKFPDGFWLG 

GAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATTTTCCCAGTCATCTCA 
EQLVCWQAGTTPWNIFPVIS 

CTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACCATCCTTCCGCAGCAA 
LYLMGEVTNQ5FRITILPQQ 

TACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGTTACAAGTTTGCCATC 
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FIGURE 2 (2) 



YLRPVEDVAT5QDDCYKFAI 

TCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTC 
SQSSTGTVMGAVIMEGFYVV 

TTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGCCATGTGCACG7.TGAG 
FDRARKRIGFAVSACHVHDE 

TTCAGGACGGCAGCGGTGGAA3GCCCTTTTGTCACCTTGGACATGGAA3ACTGTGGCTAC 
FRTAAVEGFFVTLDMEDCGY 

AACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCCTATGTCATGGCTGCCATC 
N Z PQTDESTLMTIAYVMAA1 

TGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGC 
CALFMLPLCLMVCQWRCLRC 

CTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTGAAGTGAGGAGGCCCA 
LRQQHDDFADDISLLK 

TGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCACTTTGGTCACAAGTA 
GGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCCCCACCCACCAAATGC 
CTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAGGGACTGTACCTGTAG 
G AAAC AG AAAAG AG AAG AAAG AAG C AC TCTG C TG G C GG G AAT AC TC TTGG TC AC C T C AAA 
TTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCTTTGTCCACCATTCCT 
TT AAATT C TC C AAC C C AAAG TATTC TTC TTTTC TT AG TT TC AG AAG T A C TG G CAT C AC AC 
GCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAGAGACCAAGCTTGTTT 
C C C TG C T GG C C AAAG TC AG T AGG AG AG G ATG C AC AGTTTG C TAT TTGC TTT AG AG AC AG G 
GACTGrATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTTGAAAAAAAAAAAAA 
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FIGURE 3 (1) 



ATGGCCCAAGCCCTGCCCTGGCTTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 
MAQALPWLLLWMGAGVLPAH 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 
GTQHGIRLPLRSGLGGAPLG 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 
LRLPRETDEEPEEPGRRGSF 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 
VEMVDNLRGKSGQGYYVEMT 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 
VGSPPQTLNILVDTG3SKFA 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCA3CTGTCCAGCACA 
VGAAPHPFLHRYYQRQLSST 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 
YRDLRKGVYVPYTQGKWEGE 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 
LGTDLVSIPHGPNVTVRANI 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 
AAITESDKFFTNGSNWEGIL 

GGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCT 
GLAYAEIARPDDSLEPFFDS 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCrTC 
LVKQTHVPNLFSLQLCGAGF 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATOATCATTGGAGGTATC 
PLNQSEVLASVGGSMI IGG I 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCG3CGGGAGTGGTATTAT 
DHSLYTGSLWYTP IRREWYY 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAG 

KVIIVRVEINGQDLKMDCKE 

TACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 
YNYDKSIVDSGTTNLRLPKK 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 
V? EAAVKS I KAAS S TEKF PD 
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FIGURE 3 (2) 



GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 
G F WLG E QL V C W Q A G TT PWN I 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 
FPVISLYLMGEVTNQSFRIT 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCGAAGACGACTGT 
ILPQQYLRPVEDVATSQDDC 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAG 
YKFAI SQSST3TVMGAVIME 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCTGTCAGCGCTTGC 
G FYVVFDRARKR I G FAVSAC 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATXS 
HVHDEFRTAAVEGPFVTLDM 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCCTAT 
EDCGYNIPQTDESTLMTIAY 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGl'GG 
VMAAI CALFMLPLCLMVCQW 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTG 
RCLRCLRQQHDDFADBI SLL 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCA 
K 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCC 
C C ACC C AC C AAATG C C TCTG C C TTG ATG G AG AAGG AAAAG G C TGG C AAG GTG G G TT C C AG 
GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATACT 
CTTGGTCACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCT 
TTG TCC AC CATTC C TTTAAATTC TC C AACCC AAAGTATTCTTCTT TTCTTAG TTTC AGAA 
GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 
AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 
TG C TTT AG AG AC AGG G ACTG T AT AAA C AAG C C T AAC AT TG G TGCAAAG ATTG C C T C TTG A 
AT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 4 



ATG-SCCCCAGCGCTGCACTGGCTCCTGCTArGGGTGGGCTCGGGAATGCTGCCTGCCCAS 

MAFALHWLLLWVGSGMLPAQ 

GGAACCCATCTCGGCATCCSGCTGCCCCTTCGCAGCGGCCTGGCAGGGCCACCCCTGGGC 

GTHLG IRLPLiRSGLAGPPLG 

CTGAGGCTGCCCCGGGAGACTGACGAGGAATCGGAGGAGCCTGGCCGGAGAGGCAGCTTT 

LP. LPRETDEE5EEPGRRGSF 

GTGG AGATGGTGG AC AACCTGAGGGGAAAGTC CCOCC AGGGCTAC TATGTGGAGATGAC C 

VEMVDNL-RGKSGQGYYVEMT 

GTAGGCAGCCCCCCACAGACGCTCAACATCCTGGTGGACACGGGCAGTAGTAACrrTGCA 

VGSPPQTZjNILVDTGS S N F A 

GTGGGGGCTGCCCCACACCCTTTCCTGCATCGCTACTACCAGAGGCAGCrGTCCAGCACA 

VGAAPKPFLHRYYQRCLSST 

TATCGAGACCTCCGAAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAGGGGGA^ 

Y R D L R K G V Y V P Y T Q 3 K W E G E 
CTGGGCACCGACCTCGTGAGCATCCCTCATGGCCCCAACGTCACTGTGCGTGCCAACATT 
LGTDLVS I PHGPNVTVRANI 
G-TGCCATCACTGAATCGGACAAG'rTCTTCATCAATGGTTCCAACTGGGAGGGCATCCTA 
AAITESDXFFINGSNWEGIL 

G GG Z TGG C CT ATG C TG AG AT TG C C AGG C C C G A C G AC T C TTTGG AG C CCTT C TTTG AC TC C 
GLAYAE IARPDDSLEPFFDS 
CTCGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCTGGCTTC 
L V K Q T H I P N I r S L Q h C G A G F 
CCCCTCAACCAGACCGAGGCACTGGCCTCG3TGGGAGGGAGCA7GATCATTGGTGGTATC 
P L, N Q T E A L A S V G G S M I I G G I 
GAC^ACTCGCTATArACGGGrAGTCTrTGGTACACACCCATCCGGCGGGAGTGGTATTAT 
C HSLYTGSLWYTPIRP. EWYY 
GAAGTGATCATTGTACGTGTGGAAATCAATGGTCAAGATCTCAAGATGGACTGCAAGGAG 
EVI IVRVEINGQDLKMDCKE 
TACAACTACGACAAGAGCATTGTGGACAGTGGGACCACCAACCTTGGCTTGCCCAAGAAA 
YtJYDKS IVDSGTTNLRLPKK 
GTATTTGAA'^CrGCCGTCAAGTCCATCAAGGCAGCCTCCTCGACGGAGAAGTTCCGGGAT 

V F E A A V K S IKAASSTHKFPD 
GG C TTTTGG CT AGGG G AGC AG C TGG TG TG CTGG C AAG C AGG C AC G AC C C C TTGG AAC ATT 
GFWLGEQ LVCWQAGTTPWNI 
TTCCCAGTCATTTCACTTTAGCTCATGGGTGAAGTCACCAATCAGTCCTTCCGCATCACC 
FPVI SLYLMGEVTNQSFRIT 
ATCCTTCCTCAGCAATACCTACGGCCGGTGGAGGACGTGGCCACGTCCCAAGACGACTGT 
T LPQQYLRPVEDVAT3QDDC 
TACAAGTTCGCTGTCTCACAGTCATCCACGGGCACTGTTATGGGAGCCGTCATCATGGAA 
YKFAVSQ SSTOTVMSAVIME 
GG TTTC TATGTCGT C TTCG ATC G AG C C C G AAAG C G AATTGG C TT TG C TGTC AG C G C TTGC 
GFYVVFDRARKRIGFAVSAC 
CATGTGCACGATGAGTTCAGGACGGCGGCAGTGGAAGGTCCGTTTGTTACGGCAGACATG 
P-VHDEFRTAAVEGPFVTADK 
GAAGACTGTQGCTACAACATTCCCCAGACAGATGAGTCAACACTTATGACCATAGCCTAT 
EDCGYNI PQTDESTLMTIAY 
GTGATGGCGGCCATCTGCGCCCTCTTCATGTTGCCACTCTGCCTCATGGTATGTCAGTGG 
VMAAICALFMLrLCLMVCQW 
CGCTGCCTGCGTTGCCTGCGCCACCAGCACGATGACTTTGCTGATGACATCTCCCTGCTC 
RCLRCLRHQHDDFADD I SLL 
AAG ? AAGG AGGC TC GT GGGC AG ATG ATG G AG ACG C C C C TGG AC C A C ATC TGGGTGG TTC C 
Y. 

C TTTGGT C A C ATGAG TTG GAG C TATGGATGGTAC C TGTGGCC AG AGC AC CTC AGGA C C C T 
CACCAACCTGCCAATGCTTCTGGCGTGACAGAACAGAGAAATCACGCAAGCTGGATTACA 
GGG Z TTG C A C CTGTAG G AC AC AGG AG AGGG AAGG AAG C AG CGTTCTGGTGGC AGGAATAT 
CCTTAGGCACCACAAACTTGAGTTGGAAATTTTGCTGCTTGAAGCTTCAGCCCTGACCCT 
CTGCCCAGCATCCTTrAGAGTCTCCAACCTAAAGTATTCTTTATGTCCTTCCAGAAGTAG 
TGG C G TC AT AC TC AGG C T AC CC GG C ATGTGTCC C TGTG GT AC C C TGG C AG AG AAA G GG C C 
AATCTC ATT CCCTGC TGG CC AAAG TC AGC AGAAGAAGGTGAAGTTTGCCAGTTGCTTTAG 
TGArAGGGACTGCAGACTCAAGCCTACACTGGTACAAAGACTGCGTCTTGAGATAAACAA 
GAA 
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FIGURE 5 



1 -MAQALPWLLLWMGAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLPRETDEE 5 0 

II || IIIIM-MII II I 1 I I I I I I I I I I I i I I 1 I I I I 1 I I 
1 MAPALHWLLLWVGSGMLPACGTHLGIRLPLRSGLAGPPLGLRLPRETDEE 5 0 

51 PEEPGRRGSFVEI4VDNLRGKSGQGYYVEMTVGSPPQTLHILVDTGSSNFA 100 

IIIIIIIIMMIIIIIMMMIMMMIIIIIIIIIMIIIIIII! 

51 SEEPGRRGSFVEI-T/DNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFA 100 
101 VGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPH 15 0 

I I [ 1 1 1 [ 1 1 i j 1 1 1 1 1 1 M 1 1 1 1 i i I ! 1 1 1 1 1 1 1 1 • 1 1 1 1 1 1 k 1 1 1 1 1 1 1 

101 VGAAPHPFLHRYYQRQLSSTYRDLRKGVYVFYTQGKWEGELGTDLVSIPH 15 0 
151 GPMVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDS 200 

IMIIIMMIMMIIIIIII'IIIIIIIIIMIIIMMIIIMIII! 

151 GPMVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDS 200 
201 LVKCTHVPNLFSLQLCGAGFPLNQSEVLASVGGSMIIGGIDHSLYTGSLW 250 

I I I 1 1 1 : 1 1 ^ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! IIIIMIMIIIIMMIMM: 

201 LVKQTKIPNIFSLQLCGAGFPLNQTEALASVGGSMIIGGIDHSLYTGSLW 250 
251 YTPIRFEVJYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKK 3 00 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 

251 YTPIRREVJYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKK 300 
301 VFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMG 3 50 

MIIIIIIIMIIIIIIIIIIMMIHIIIIIIMIIIMMIIIIMI , n 

301 WEAAVKSIKAASSTEKFPIX3FWLGEQLVCWQAGTTPVWIFFVISLYLMG 3d0 
351 EVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIME 4 00 

MlllllllillMMIIIIMIMIIIIIIIhlllllMMIIIIII _ 

351 EVTNQSFRITILP2QYLRPVEDVATSQDDCYKFAVSQSSTGTVMGAVIME 400 
401 GFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 450 

lllllllillllllllllllllllllllMIMIIM IIIIMIMlll 

401 GFYWFDRARKRIGFAVSACIIVHDEFRTAAVEGPFVTADMEDCGYNIPQT 450 
451 DESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRQQHDDFADDISLL 500 

i 1 1 f 1 1 1 ! 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 ! 1 Nil I INI II I rnn 

451 DESTLMTIAYVMAAICALFMLPLCLMVCQWRCLRCLRHQHDDFADDISLIi 500 

501 K 501 
I 

501 K 501 
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FIGURE 6 (1) 



ATGGCTAGCATGACTGGTGGACAGCAAATGGGTCGCGGATCCACCCAGCACGGCA r rCCGG 
MASMTGGQ3KGRGS T Q K G I R 

CTGCCCCTGCGCA:3CGGCCTGGTG3GCGCCCCCCTGGGGCTGCGGCTGCCCC3GGAGACC 
LPl.RSGLGGA?liGLRL?RET 

GACGAAGAGCCCGAGGAGCCCG.3CCGGAGGGGCA3CTTTGTGGA3ATGGTGGACAACCTG 
DEEPESPGRRGSFVEMVDNL 

AGGGGGAAGTCGGGGCAGGGCTACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACG 
RGKSGQGYYVEMTVGSPPQT 

CTCAACATCCTGGTGGATACAC-GCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCC 
LN I LVDTGSSNFAVGAAPHP 

TTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGC 
T L HRYYQBQL S STYRDLRK G 

G'rGTATGTGCCCTACA"CCAGGGCAAG'rGGGAAGGGGAGCTGGGCACCGACCTGGTAAGC 
VYVPYTQGKWEGSLGTDLVS 

ATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACAl"rGCTGCCATCACTGAATCAGAC 
I PHGPNVTVRANIAAITESD 

AAGTTCTTCACCAACGGCTCCAACTGGGAAGGCATCCrGGGGCTGGCCTATGCTGAGATT 
K F F INGSNWEGILGLAYAEI 

GCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTT 
ARPDCSLEPFFDSLVKQTHV 

CCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAGTG 
PNLFSLQLCGAGFPLNQSEV 

CTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATCGACCAGTCGCTGTACACAGGC 
LASVGGSKIIGGIDHSLYTG 

AGTCTCTCGTATACACCCATCCGGCGGGAGTGGTATTATGAGGTCATCA'TTGTGCGGGTG 
SLWYTPIr. REWYYEVI IVRV 

GAGATCAATGGACAGGATCTGAAAATGGAC'TGCAAGGAGTACAACTATGACAAGAGCATT 
EINGQDLiKXDCKEYNYDKS I 

G^GGACAGTGGCACCACCAACCTTCCTTTCCCCAACAAAGTGTTTGAAGCTGCAGTCAAA 
V D SGTTNLR l> PK K V F E A A V K 

T C C AT C AAG G C AG CCTCCTCCACG G AG AAG TT C C C TG ATGG ?TT CTGG CTAGG AG A 3 C AG 
S IKAASSTEK7PDGFWLGEQ 

CTGGTGTGCTGGCAAGCA3GCACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTAC 
LVCWQAGTTPWNIFPVI SLY 

CTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACCATCCT1CCGCAGCAATACCTG 
LMGEVTNQSFRITILPQQYL 

CGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGTTACA/vGTTTGCCATCTCACAG 
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FIGURE € (2) 



RPVEDVATSQEDCYKFAISQ 

tcatccacgggcac'rgitatgggagctgttatca'rjgagggcttctacgttgtctttgat 
s stgtvmgavimegfyvvfd 

cgggcccgaaaacgaattggcttigctgtcagcgcrtgccatgtgcacgatgagttcagg 
rarkrigfavsachvhdefr 

acggcagcggtggaaggcccttttgtcaccttggacatggaagactgtggctacaacatt 
taavegpfvtldmedcgyn: 

cc ac agacagatgagtcatga 

P Q T D E S * 
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FIGURE 7 (1) 

ATGGCTAGCA7GACTGGTGGACAGCAAATGGGTCSCGGATCGATGACTATCTCTGACTCT 
MASMTGGQQMGKGSMTI 5 PS 

C C G CG TG AA C AGCAC GG ATC C AC C C AG C A C G GC AT C CGG C TGC C C C TG C G C AG C G G C C TG 
? P £ Q i: G S 1' C. H G I R L P Ij R S G Ij 

GGGG3CGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCC 
GGAPLGLRLPFETDEEPEEP 

GGCCGGAGGGGCAGCTTTGTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGC 
G F RGSFVEMVDNLRGKS J Q G 

TACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCTCAACATCCTGJTGGATACA 
YYVEMTVGSPPQTLNILVDT 

GG CAGC AGTAACTTTGC AGTGGGTGCT3 C CC C CC AC CCCTTC CTGC ATCGC T AC TAG C AG 
G F. SNFAVGAAPHPFLHRYVQ 

AGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGCGTGTATGTGCCCTACACCCAG 
R g L S £ T Y F. DLRKGVYVPYTQ 

GGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTC 
GKWEGELGTDLVSIPHGPNV 

ACTGTGCGTGCCAACATTGCTCCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCC 
TVRANIAAITESDKFFI 11 GS 

AACTGGGAAGGCATCCTGGGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTG 
NVJ EG I LGLAYAE I ARPDD 3 L 

GAGCCTTTCTTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAG 
EPFFDSLVKQT>iVPKLFSLQ 

CT'I^-GTGGTGCTGGCTTCCCCCTCAACCAGTCTCAAGTGCTGGCCTCTGTCCSGAGGGAGC 
LCGAUFPLM2SEVLASVGG3 

ACGATCArTGGAGGTATCGACCACTCGCTGTACACA'GGCAGTCTCTGGTATACACCCATC 
MI IGGIDHSLYT3SLWYTPI 

CGGCGGGAGTGGTATTATGAGGTCATCATTGTGCGG37GGAGATCAATGGACAGGATCTG 
RREWYYEVIIVRVEINGQDL 

AAAATfiOACTGCAAGGAGTACAACTATGACAAGAGCATTGTGGACAGTGGCACCACCAAC 
KKDCKEYNYDKS IVDSGTTN 

CTTCGTTTGCCCAAGAAAGTG'rTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCC 
L. K LPKKVFHAAVKS I K A A S S 

ACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGG^AAGCAGGC 
TEKFPDSFWLGEQZjVCWQAG 

ACCACCCCTTGGAACATTTTCCCAGTCATCTCACTCTAGCTAATGGGTGAGGTTACCAAC 
TTPWNIFPVIS^YLMC-EVTN 
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FIGURE 7 (2) 



CAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGGCCA3TGGAAGAT3TGGCC 
Q. C FR ITILPQQYLRFVEDVA 

ACGTCCCAAGACGACTGTTACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGrTATG 
TSQDDCYXFAIS2SSIGTVM 

GGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGC 
GAVIMEGFYVVFDRARKKIG 

7TTG CI "G T C AGC G C TTGC C ATGTG C A CG ATG AGT TC AGG A C GG C A3 C G 3 TGG AAG G C C C T 
F A V £ A C H V H C E F R ? A A V E G P 

TTTGTCACCTTGGACATGGAAGACTGTGC;CTACAACATTCCACAGACAGATGAS7CATGA 
FVTLDMEDCGYNIPQTDES * 
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FIGURE 8 (1) 



ATGACrCAGCATGGTATTCGTCTGCCACTGCGTAGCGGTCTGGGTGGTGCTCCACTGGGT 
MT-QHGTRLPLRSGLGGAPLG 

:'rGCG'l'CiX3CCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGC r r:"I' 
LRLPRSTDEEPEF.PGRRGSF 

3 r ; i GGAGATGGTGGACAAGCTGAGGC>GC.A/vG7CGGGGCAGGGCTACTACG r rGGAGA r L , GACC 
VEMVDNLRGKSGQGYYVEMT 

GTGGGCAGCCCCCCGCA3ACGCTCAACATCCTGGTGGATACAGGCA3CAGTAACTTTGCA 

V G S ? P Q T L M I L V D T G 5 5 N F A 

G'?GGGTGCTGCCCCCCACCCC? , rCC^G::ATCGCTACTACCAGAGGCAGCTG f rccACJCACA 

V G A A ? H P F L H R Y Y Q R Q Ij S S T 

TACCGGGAGCTCCGGAAGGGCGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 

V K L) ij K K G V V V P Y T <j G K W E G E 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 
G T D L V S I P H G P N V T V R A N I 

GCTGCCATCACTGAATCA3ACAAGTTCTTCATCAACGGCTCCAACTG3GAAGGCATCCTC 
AAITESDKFFINGSNWEGIL 

GGC^TGGCCTAT-SCTGAGATTGCCAGGCCTGACGACTCCCTGGAC-CCTTTCTTTGACTCT 
G A Y A E IARPDDS!jEPFFDS 

CTGGTAAAGCAGACCCACGTTCCCAACCI^TTCTCCCTJCACCTTTGTGGTGCTGGCTTC 
v V K Q ? H V P N L F S L Q L C G A G F 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGG3AGCATGATCA1TGGAGGTATC 
PLNQSEVLASVGGSMI I G G I 

GACCACTCGCTGTACACAGGCAGTCTCT3GTATACACCCATCCGGCGGGAGTGGTATTAT 
i; H 3 L Y T G S I, W Y T P I R P E W Y Y 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACrGCAAGGAG 
E V I IVRVE INGQELKKDCKE 

TACAACTATGACAAGAGCATTGTGI3ACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 
YNYDKS IV3SGTTNLKLPKK 

GTGTTTGAAGCrGCAGTCAAATCCATCAAGGCAGCCTCCrCCACGGAGAAGTTCCCTGAT 

V F E A A V K S IKAASS7EKFPD 

GG'I'TTCTGGCTAGGAGAGCAGCTGG'rG'^GGTGGCAAGCAGGCACCACCCCT'l'GGAACATT 
G F W L G E Q L V C W Q A 3 T T P W N I 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTTCGCATCAGC 

fpvislylmgevt hqs frit 
atccttccgcagcaatacctgcgg:cagtogaagatgtggccacgtcccaagacgactgt 

ILPQQYLRPVEDVATSQDDC 
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FIGURE 8 (2) 



TACAACriTTGCCATCTCACAGTCATCCACGGGCACTGrTATGGGAGCTGTTATCATGGAG 
YKFAIS QSSrGTVHCAVIME 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAAAACGAATTGGCTTTGCrGTCAGCGCrTGC 
GFYVVF DRARKK I GFAV SAC 

CATTAG 
H 
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FIGURE 9 



IP: Ab 369 



V-CTF99 



4-CTF99 
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FIGURE 10 
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FIGURE 11 



MAOALPWLLLWMGAGVLPAHG TQHGIRLPLRSGLGGAPLGLRLPRETDEE 
PEEPGRRGSFVEM^/DNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFA 
VGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPH 
GPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDD3LEPFFDS 
LVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGGSMIIGGIDHSLYTGSLW 
YTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKK 
VFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPVJNTFPVTSLYLJ4G 
EVTN2SFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIME 
GFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 

DES 
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FIGURE 12 



MAOALPWLLLWMGAGVLPAHG TQHGIRLPLRSGLGGAPLGLRLPRSTDEE 
PEEPGRRGSFVEMVDNLRGKSGQGYYVEMTVGSPPQTLNILVDTGSSNFA 
VGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPH 
GPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDS 
LVKQTHVPNLFSLQLCGAGFPLNQSEVLASVGGSMIIGGIDHSLYTGSLW 
YTPIRREWYyEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKK 
VFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMG 
EVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIME 
GFYWFDRARKRIGFAVSACHVHDEFRTAAVSGPFVTLDMEDCGYNIPQT 
DHSHHKHHH 
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SEQUENCE LISTING 

1 1 C > Curnpy, Mar.< E. 

EienkuwaKi , Mi-hdel J. 
Hoinr ikson , Robert L. 
Parodi , Luis A . 
Yan, siqiang 

Pharmacia & Upjohn Company 
<120> Alzheimer's Disease Secretase 
<130> 6177.P CP 

<I41> 

•:150> 6C/1C1,594 
•■.151> 1998-09-24 

vl60> 45 

<170> Patentln Ver. 2.0 

<-210> ; 

v211> 1804 

<212> DNA 

<213> Homo sapiens 

•400> 1 

atygycgcac tggcccqggc gctgctgctg cctctgctgg cccagtggc:t: cctgcgcgcc £0 
gccccggagc tggcccccgc gcccttcarg ctgcccctcc gggtggccgc ggccacgaac 120 
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cgcgtagttg cgcczacccc gggacccggg 
gcgctcgccc tqgagcctqc cctggcgtcc 
gtagacaacc tgcaggggga ctctggccgc 
cccccg^aga agctacagat tctcgttgac 
accccg~act ccta^atags cacgtactt t 
aagggcttty acgLcacagt gaagtacaca 
aacctcqtca ccatccccaa aqqcttcaat 
-ttgaatcag agaatttctt t-tgcctggg 
Utgccacac t'zgccaagcc aticaagttct: 
caagcdaaca tccccaacgt LtiLctccaty 
gcatctggga ccaacggagg tagt^ztgr.c 
ucagacatct. cgtaLacccc -actaaggaa 
t.r.ggaaat- rg gaggrnaaag rcttaat-.ctg 
atcgtggaca gtggcaccac gctgctycyc 
craagctgtgg cccgcgcatc tccgattcca 
■rage tggcgt. gctggacgaa t:tcggaaaca 
lace tgagag atgagaactc cagcayytuu. 
attcagccca tgargggggc cggcctgaat 
•\ccacaaatg cgctggtgat cggtgccacg 
dgagcccaga agagggtggg cttcgcagcg 
gtcrtctgaaa tttccgggcic t ttctcaaca 
cagtctttga gcgagcccat tttgtggatt 
gccaLcct-c ttgtct-aat cgtcctgctg 
■;gtgaccctg aggtcgtcaa tgatgagtcc 
gccaggccLg acctcaagca accatgaact 
agcagccggg atcgatrjgtg gcgcLLLcLc 
gctcccagat gccttcLaga ttcactgtct 
ctccctaccr ccaagaaaaa caattaaaaa 
aaaa 



acccctgccg agcgccacgc cgacggcttg 150 
cccgcgggcy ccgccaactt cctggccatg 240 
ggctactacc tggagatgct gatcggga^c jCC 
actggaagca gtaactttgc cgtgycagga 360 
gacacagaga ggtctagcar ataccgctrc 4 2 C 
caaggaagct ggacgggctt zgttggggaa 48C 
actlctt etc ttgtcaacat !:gccactat:t 5 4 C 
attaaatgga atggaatact rggectaget 60C 
ct:ggagacct t:ct:tcgactc ccLgctgaca C60 
cagatgtgtg gagccgcctt. gccccttgct 720 
ttgggtggaa rtgaaccaag tttgtataaa 73 0 
gag -cg^acc accagataga aattctgaaa 840 
ga~t-g<-agag agt- ataacgr: agacaaggc:: 9 00 
ctgccccaga aggtytttga tgcggtggtg 960 
gaatrctctg atggtttctg gactgggtcc 1U20 
cct tggCctt actuccctaa aatctccatc 1080 
ttcjgtatca caa'cctgcc tcagctttac 1140 
tatgaatget accgattegg catttcccca 1200 
gtgatggagg gettcraegt catcttcgao 1260 
agc~cctgtg cagaaattgc aggtgetgea 1320 
yaggatgtag ccagcaactg tg'ccccgct 1380 
gtgtcctatg cgctcatgag cgictgtgga 1440 
ctgcrtgccgt tccggtgtca gcgtcgcccc 1500 
tctctggtca gaeategctg gaaatgaata 1560 
cagctattaa gaaaatcaca tttccacggc 162C 
ctytgcccac ccgtcttcaa tctctgttc^ 1660 
tttgattctt gatt-tcaay ctttcaaatc 1740 
aaaaacttca ttctaaacca aaaaaaaaaa 1800 

1804 



<210> 2 
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<2l\> 516 
<212> PRT 

*'.1H:> Horr.o sapiens 
<4 0 0> 2 

Met Gly Ala Leu Ala Arg Ala Leu Leu Leu Pro Leu Leu A La Gin Trp 
1 5 10 15 

Leu Leu Aiy Aid Ala Pro Gin Leu Ala Pro Ala Pre Phe Thr Leu Pro 

22 2S 30 

Leu Arg Val Ala A^a Ala Thr Asn Arc Val Val Ala pro Thr pre Gly 
35 4C 45 

Pro 3iy Thr Pro Ala CJlu Arg His Ala Asp Gly Leu Ala Leu Ala Leu 

50 55 50 

Glu Pro Ala Leu Ala Ser Pro Ala Gly Ala Ala Asn Phe Leu Ala Met 
65 70 75 30 

Val Asp Asn Leu Gin Gly Asp Scr Gly Arg Gly Tyr Tyr Leu Glu Ket 
85 90 95 

Leu I la Gly Thr Pro Pro Gin Lys Leu Gin lie Leu Val Asp Thr Gly 

1G0 105 110 

Ser Ser Asn Fhe Ala Val Ala Gly Thr Pro His Ser Tyr lie Asp Thr 
115 120 12b 

Tyr Pho Asp Thr Glu Arg Scr Ser Thr Tyr Ary Ser Lys Gly Phe Asp 
130 ~ 135 140 
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Val Thr Val Lys Tyr Thr Gin Gly Ser Trp Thr Gly Phe Val Gly Glu 
145 150 1S5 163 

Asp Leu Val Thr lie Pro Lys Gly Phe A. sr. Thr Ser Phe Leu Val Asr. 

165 17G 17S 

He Ala Thr He Phe Glu Ser Glu Asn Phe Phe Leu Pro Gly lie Lys 
180 135 19 0 

Trp Asn Gly He Leu Gly Leu Ala Tyr Ala Thr Leu Ala Lys Pre Ser 

195 2C0 2 OS 

Jcr Scr Lou Glu Thr The Phe Asp Ser Leu Val Thr Gin Ala Asn He 
210 215 220 

Pro Asn Val Phe Ser Ket Gin Met Cys Gly Ala Gly Leu Pro Val Ala 

;25 230 235 24C 

Gly Ser Gly Thr Asn Gly Gly Ser Leu Val Leu Gly Gly He Glu Pro 
245 250 255 

Ser Leu 'Tyr Lys Gly Asp He Trp Tyr Thr Pro He Lys Glu Glu Trp 
260 265 270 

ryr r :yr Gin He G^u Tie Leu Lys Leu Glu lie Gly Gly Gin Ser Leu 
275 2S0 285 

Asn Leu Asp Cys Arg Glu Tyr Asn Ala Asp Lys Ala Hp Val Asp Ser 

290 295 300 
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Gly Thr Thr Leu 
305 

Glu Ala Val Ala 

Trp Thr Gly yer 

34 0 

Ser Tyr Phe Pre 
355 

Arg Sfir Phe Arg 
37C 

Met Gly Ala Gly 

3 3 5 

iier Thr Asn Ala 

Val He Phe Asp 
420 

Jys Ala Glu He 
435 

Ser Thr Glu Asp 
45 C 

Glu Pre He Leu 



Leu Arg Leu Pre 
31C 

Arg Ala Ser Leu 

325 

Gin Leu Ala Cys 

Lys He Ser He 
3 6C 

Ha Thr Tie Leu 
375 

Leu Asn Tyr Glu 

Leu Val lis Gly 
405 

Arg Ala Gin Lys 

Ala Gly Ala Aid 
440 

Val Ala Scr Asn 
455 

Trp He Val Ser 



Gin Lys Val Phe 

3 15 

He Pro Glu Phe 

33C 

Trp Thr Asn Ser 
345 

Tyr Leu Arg Asp 

Pro Gin Leu Tyr 
380 

Cys Tyr Arg Phe 
3 95 

Ala Thr Val Met 
410 

Arc Val Gly Phe 
425 

Val Ser Glu He 

cys Val Pre Ala 
460 

Tyr Ala Leu Met 



Asp Aid Val Val 
320 

Ser Asp Gly Phe 
335 

Glu Thr Pro Trp 
35C 

GLu Asn Ser Ser 
3 65 

lie Gin Pro Met 

Gly He Ser Pro 
400 

Glu Gly Phe Tyr 
415 

Ala Ala Ser Pro 

430 

Ser Gly Pro Phe 
445 

Gin Ser Leu acr 
Ser Val Cys Gly 
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4 :5 470 475 480 

Ala He Leu Leu Val Leu He Val Leu Leu Leu Leu Pro Phe Arg Cys 
485 490 495 

GLn Arg Arg Pro Arg Asp Pre Glu Val Val Asn Asp Glu. Ser Ser Leu 
500 505 5ic 

Yal Arg His Arg Trp l.ys 
515 



-;no> i 

<211> 2070 

<212> DNA 

<213^ Hcrr.c sapiens 

-40J> 3 

atggcccaag ccctgccctg gctcctgctg 
ggcacccagc a~ggcatccg gctgccccLg 
-tg^ggctgc cccgggagac cgangaagag 
gtggagatgg tggacaacct gaggggcaag 
otgygcagee ccccgcagac gctcaacatc 
gtgggtgctg ccccccaccc ctlcctgcat 
La^cgggacc teeggaaggg tgT.gtaLy 
rtgggcacr.g ac=tggtaag ca:.ccc:cat 
gctgccatca ctgaatcaga caagttcttc 
gggctggcct atgetgagat tgccaggcct 
ctggtaaagc agacccacgt t:ccaacctc 
<.cecLcaa.ee agtctgaagl yetgycclcL 
gaccactcgc tgtanacagg cagtctctgg 



cggatgggcg cgggagtgc- gcctgcccac 6 0 
egcagegyce tygggyycyc ccccclyyyy 120 
nccgaggagr oeggceggag gggcagrtr.t ISO 
tieggggcagg getactaegt ggagatgacc 240 
ctggtggata cagccaacag ta^ctttgea 3 00 
cgctactacc agaggcagct gtccagcaca 360 
ccctacaccc aggycaayty yyaayyyyag 420 
ggccccaacg tcactgtgcg tgecaacatt 480 
arcaaegget ccaactggga aggcatcctg 54 0 
gacgactccc iggacccztz ctttyactci 6C0 
ttctccctgc acctttgtgg tgctggcttc 660 
ylcyyaggga gcalyatcat tyyaggtatc 72 0 
ta'.acaccca tecygeggga gtggtattat 760 
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caggtcatca ttigtgcgggt ggagat caal 
tacaaccacy acaayaycao tytygacagt 
ctgtttgaag ctgcagtcaa atccatcaag 
eytttc-yyc tayyayayca yctyytgtgc 
t.tccnagtca tctcacf.cta cctaatgggt 
atccttccgr: agcaacacct gcggcoagtg 
lacaagLlity ccatctcaca qtcazccacq 
ggcttctacg ttgtctttga tcgggcccga 
i:acgtgcacg atgagttcag gacggcagcg 
ijaagacUjLcj yctacaacat tccacagaca 
gtcatggctg ccatrtgcgc cctcttcatg 
ccctgcctcc gctgcctgcg ccagcagcac 
,-ingt-gaggag gcccatgggc agaagatag-. 
■;«ttggtcac aagtaggaga cacagatggc 
ocacccacca aatgccrtctg ccttgatgga 
jgactgtacc Lgtaggaaac agaaaayaga 
.:ttygLcacc Lcaaatttaa yLcgyyaaat 
ttgcccacca Ltcctttaaa ttctccaacc 
gtactggcat cacacgcagg ttaccttggc 
agaccaagct tgtttcnctg ctggccaaag 
rgctttagag acagggactg tataaacaag 
attaaaaaaa aaaaaaaaaa aaaaaaaaaa 



ggacagga^c rgaaaatgga ctgcaaggag 840 
agcaccscca accttcgttc gcccaagaaa 9C0 
g^agcctcct ccacggagaa gtticcctga- 9 60 
tgycaagcag gcaccaccc- ttgcaacatt 1C2C 
gaggttacra ar:cagtcctt ccgcatcanc 108 0 
gaagatgtgg c:acgt^cc3 ogacgactgt 1140 
gcjcactgtta tgaqagctgt catcatggay 12C0 
aaacgaattg g-tttgctgt cagcgcttgc 1260 
gtggaaggcc :::t:ctcac cttggacatg 1j2C 
gatgaytcaa ccctcatgac catagcctat 13 SO 
ctgcca~tct gcctcatggt gtgtcagtgg 1 4 4 f> 
gacyact.ttg ctgatgacat ctccctgctg 1500 
gat: t cere gaccacacrt- cegfggt tea IS fit; 
acctgtggcc agagcacetc aggaccctcc 162G 
gaaggaaaag gctggcaagg tgggttccag 1680 
agaaagaagc actctgetgg -gggaatact 1740 
tutgciyctt yaaacttcay ecctgaacct 1800 
caaagtattc ttcttttctt agt^tcagaa 1860 
gtg^gtcect gtggtaccct ggcagagaag 1920 
tcagtaggag aggatgeaca gtttgetatt 1980 
cctaacatty grycaaagat" tgcctcttga 2040 

2070 



<210> 4 
v211> 531 
<212> PRT 

<213> lloir.o sapiens 
<400> 4 

Met Ala Glr: Ala Leu Pro Trp Leu Leu Leu Tip Ket Gly Ala Gly Val 
IS 10 IS 
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Leu Pro A. a His Gly Thr Gin His Gly Ila Arg Lea Pro Leu Arg Ser 
20 2S 30 

Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Fro K.g Glu Thr Asp 
35 40 45 



Glu Glu Pro Glu Giu Pro Gly Arg 
30 55 

Asp Asn Leu Arg Gly Lys Ser Gly 
65 70 

Val Gly Ser Pro Fro Gin Thr Leu 
85 

oe: Asn Phs Ala Val Gly Ala Ala 
100 



Arg Gly Ser Phe Val Glu Ms: Val 
60 

Clx Gly Tyr Tyr Val Giu Met Thr 

7 b 8 0 

Asn He Leu Val Asp Thr Gly Ser 
90 95 

Pre His Pro Phe Leu His Arg Tyr 
105 110 



TV- Gli, Any Gl;: Leu S^r Sui Thi Tyi Ary Asp Leu Ax y Lyt Gly Vet- 
11 B 12G 125 



Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Giu Leu Gly Thr Asp 
130 135 14C 

Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He 

145 150 155 150 

Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp 
165 170 175 
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Glu Cly lie Leu 
180 

Eer Leu Glu Pro 

A^n Leu Phe Sur 

21.0 

Ser Glu Val Leu 
225 

Asp His Ser Lea 

Glu Trp Tyr Tyr 
260 

Asp Leu Lys Met 

2?: 

Asp Ser Gly Thr 
290 

Ala Val Lys Ser 
305 

Gly phe Trp Leu 
Pro Trp Asn lie 



Gly I gu A. a Tyr 

?he Phe Asp Ser 

20U 

Leu His Leu Cyb" 
215 

Ala Ser Vdl Gly 

230 

Tyr Thr Gly Ser 
245 

Glu Val lie Tie 

Asp Cys Lys G^u 
23 0 

Thr Asn Lou Arg 
295 

lie Lys Ala Ala 
JIG 

Gly Glu Gin Leu 

325 

Phe Pro val He 



Ala Glu tie Ala 
135 

Leu Val Lys Gin 

Gly Aid Gly Phe 

22C 

Gly ser Met He 

235 

Leu Trp Tyr Thr 
250 

Val A^g Val Glu 
265 

Tyr Asn Tyr Asp 

Leu Pro Lys Lys 

300 

Ser Ser Thr Glu 
315 

Val Cys Trp Gin 
333 

Ser Leu Tyr Leu 



Arg Pro Asp Asp 
190 

Thr His Val Pre 

205 

Pic Leu Asn Gin 

He Gly Gly He 
240 

Pro He Arg Arg 
255 

He Asn Gly Gin 
270 

Lys Ser He Val 

285 

Val Phe Glu Ala 

Lys Phe Pro Asp 
320 

Ala Gly Thr Thr 
335 

Met Gly Glu Val 
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340 345 350 

rr r,- »\sn Gin ^pr ?hn Arg I'o Tr.r rls F,gq Pro CI r Gin Tyr f.ru Arg 
355 360 365 

Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lyn Phe Ala 
370 375 380 

He Ser Gin Ser Ser Tr.r Gly Thr Val Met Gly Ala Val He Met Glu 
..i8 5 3 90 3 95 4 JO 

Uly Phe Tyr Val Veil Phe Asp Arg Ala Arc Lys Arg He Gly Phe Ala 
4 0 5 'ilC 41b 

Val Ser Ala Cys His Val His Asp Glu Fhw Arg Thr Ala Ala Val Glu 
4 21 425 430 

Gly Pro Phe val Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn He Pro 
43b 443 445 

Gin Thr Asp Glu Ser Thr Leu Met Thr He Ala Tyr Val Met Ala Ala 

450 455 460 

He Cys Ala T^i Php K*=t I.ftu Pro T.pu Cys T.pu Kftt Val Cys Gin Trr 
465 470 475 480 

Arg Cys Lou Arg Cys Leu Arg Gin Gin His Asp Asp Phe Ala Asp Asp 
485 490 495 

He Ser Leu Leu Lys 

bOU 
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-:210> 5 

<2U> 19*77 

-:212> DNA 

<21"'-> Hcruo sapiens 

^40C> 5 

atgqcccaag ccctgccctg gctcc-gctg 
ggcacccagc acggcatccc gctg~ccctg 
'^tg^ggctyc cccgggagac cgacgaagag 
gtggagatgg tggacaacct gaggggsaag 
ytgygcagcc ccccgcaqa: qclcaacatc 
g'-gygLgcLg ccccc:acc: cLtccLgca^ 
f.acergggacc tccggaaggg tgtgtatgtg 
stcggcascg acctggtaag carc:.cc:at 
gctgccatca ctyaatcaqa caagctctte 
gggctggcct atgctgagat tgccaggcf. t 
gaagtgctgg ccLctytcgj agggagcatg 
v-aggcagtc tctggtatac acccatccgg 
jggg -ggaga tcaatggaca ggaictgaaa 
aycuttytyg acagtggcac caccaaccLL 
gtcaaa^rra tcaaggcagr crc.r.r.c.r.ar.cj 
gag-:agctgg tgtgctggca agcaggcacc 
■rtctacctaa tgggtgaggt taccaaccag 
Lac^tgcggr: cagtggaaga "gtggc^acg 
L:ci:agLcat ccacgggcac tgtLatggga 
r r tgatcgyy cccyaaaacy aattggcttt. 
f.tcaggacgg r.accggtgga aggccctttt 
aacattccac agacagatga g tcaaccc tc 
:gcgccctct tcatgctgcc a^tctgcotc 



tggatgggcg cgggagtgct gcctgcccac 6 0 
cycagcggcc ~gygyggcgc "ccccLgggg 12 0 
cccgaggagc ccggccggag gggcagcttt 180 
tcggggcagg gcractacgt ggagatgacc 24 0 
ctggtggata cagycagcay taactttgca 3 00 
cgcLa~Lac:c ayagycaycL g:ccagcaca 3 60 
cccUcaccc agggcaagtg ggaagyggag 42 0 
ggccccaacg tcac tgtgcg tgccaacatt 43C 
atcaacggct ccaactqgqa aggcatcctg 540 
tgtgg-gctg gctlccccct caaccactct 5 3 C 
atrattggag gtatcgacca ctcgctgtac 660 
cgggagtggt attatgaggr. gaccattgtg 720 
atyyactgca aggagcacaa ctatgacaag 780 
cgLLt-yccca ayaaayLylt Lyaayccgca 340 
gagaagi-t.rr rt" ga -ggt 1 1- rtggntagga 900 
accccttgga acut'ttcce agtcatctca 960 
tccttccgca tcaccatcct tccgcagcaa 1020 
tcccaagacg actgttacaa gtttgccatc 1060 
ccLgtLatca tggagggcil cLacg L Lq Lc 1140 
cct-gtnagcg cttgccai"g^ gcringatgag 1200 
gtcaccttgg acatggaaga ctgtggctac 12-50 
atgaccatag cctatgtcat ggctgccatc Uz3 
atggtgtgtc agtggcgctg cctccgctgc 13S0 
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c tgc gc cage 


agcatgatga 


c tt tgc tgat 


gacatctccc 


tgctgaaytg 


aggaggceca 


1440 


tggeeagaag 


atagagattc 


csc tggacca 


cacc teegtg 


gt tcactttg 


g:cacaag:a 


1 h U U 


ggagacacag 


atggcacc tg 


tggecagagc 


acc tcaggac 


cctccccacc 


caccaaatgc 


1560 


LlcLycr: LLC 


atggagaagg 


aaaaggc tgg 


caagy tgggt 


tccagggac t 


g ta.ee tg tag 


1620 


gaaacagaaa 


agagaagaaa 


gaagcac tct 


gc tgg c gaga 


atactcttgg 


tcacctcaaa 


ib«n 


t ttaagt egg 


gaaat tc tgc 


tgc t tgaaac 


t tcagccctq 


dicct t tgte 


caccattect 


1740 


t. taaatt ore 


caacccaaag 


taticttctt 


ttcttagttt 


cagaagtac t 


ggcatcacac 


I'd CO 


ccaggttacc 


ttggcgtgtg 


tc:c tgtggt 


acec tggcag 


agaacagacc 


aagcttgttt 


I860 


c-cctgctggj 


caaagtcagt 


aygagaggat 


gcacagt t tg 


ctatt tgett 


tagagacagg 


1320 


cac r.gt a r,aa 


araagce r aa 


r.nt rggtgca 


aaga*" rgcnr 


rr tgaaaaaa 


aaaaaaa 


;^77 



*21U> r 
<211> 476 
-212> PRT 

-213> He mo sapiens 
<40C> 6 

Met Ala Gin Ala Leu. Pro Trp Leu Leu Leu Trp Me*: Gly Ala Gly Va 1 
1 5 10 15 

Leu Pri Ala His Gly Thr Gin His Gly He Arg Leu Pre Leu Arg Ser 
20 25 3C 

■;ly Leu Gly Gly Ala Pre Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Pro Glu Glu Pre Gly Arg Arg Gly Ser Phe Val Glu Ket Vai 
5C 55 60 

Asp Asn Leu Arg 31y L-ys ser Gly Gin Gly Tyr Tyr Val Glu Net Thr 
7C 7 5 8 0 
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Ya.1 Giy Ser Pro 

fpr Asn Phe Ala 
13 0 

Tyr Gin Ary Gl:: 
US 

Tyr Val Pro Tyr 
13 C 

Leu V<al Ser He 
145 

Ala Aid lit Thr 

Glu Giy He Leu 
18 3 

Ala Gly Phe Pro 
19 5 

Ser Met Tie He 
210 

Trp Tyr Thr Pro 



Pre Gin Thr Leu 

85 

Val Gly Ala Ala 

Leu Ser Ser Thr 

120 

Thr Gin Gly Lye 
135 

Pro His Giy Pre 
150 

Glu Ser Asp Lys 
165 

Gly Leu Ala Tyr 

Leu Asn Gin Ser 
200 

Gly Gly He Asp 
215 

lie Arg Arg Glu 
23C 



Asn He Leu Val 

90 

Pre His Pro Phe 
135 

Tyr Arg Asp Leu 

Trp Glu Gly Glu 
140 

Asn Val Thr Val 
155 

Phe Phe He Asn 
170 

Ala Glu He Ala 
1R5 

Glu Val Leu Ala 

His Ser Leu Tyr 
220 

Trp Tyr Tyr Glu 
235 



Asp Thr Glv Ser 
95 

Lftu His Arg Tyr 
110 

Arg Lys Gly Val 
125 

Lou Gly Thr Asp 

Arg Ala Asn He 

160 

Gly Ser Asn Trp 
175 

Arg Leu Cys Gly 
19H 

Ser Val Gly Gly 
205 

Thr Gly Ser Leu 

Val He Ha Val 
240 
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Arg Val Clu lie Asn Cly Clin Asp Lou Lys Mgl Asp Cys Lys Clu Tyr 
24b 250 255 

Asn Tyr Asp Lys Cer He Val Asp £er Gly Thr Thr Asn Leu Arg Leu 

260 265 270 

Plu Lys Lys Veil Phe Glu Aid Aid Vdl Lys Sei Il« Lys Aid Ala Ser 

27S 28 n 285 

iser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val 

290 2 9 5 3 0 0 

Cys Trp Sin Ala Gly Thr Thr Pre Trp Asn lie Phe Pro Val lie Ser 
305 210 315 320 

Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr He 
325 33 C 335 

Leu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin 
340 345 350 

Asp Asp cys Tyr Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val 
355 360 365 

Met Gly Ala Val He Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala 
370 375 330 

Arg Lys Arg He Gly Phe Ala Val Ser Ala Cys His Val His Asp Glu 
385 390 395 40C 

Phe Arg Thr Ala Ala val Glu Gly Pro Phe Val 'lhr Leu Asp Ms- Glu 
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He Ala Tyr Val Mot Ala Ala lie Cys 

4 3b 440 

L'ys Leu Met Val Cys Glr. Trp Arg Cys 
450 4bb 

His Asp Asp PI:« Ala Asp Asp He Ser 
465 47G 
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41C 416 



Asp Glu Ssr Thr Leu Met Thr 
430 



Ala Leu Pho Met Leu Pro Leu 
44 5 



Leu Arg Cys Leu Arg Gin Gin 
4bU 



Leu T.eu Lys 
475 



c210> 7 
-.21 1> :>04 3 
■:212> DNA 

<213> Mus musculus 



<400> 7 



atggcc crcag 


cgc tgcartg 


gctec tge::a 


tyggtgggct 


egggaatget 


gcctgcccag 


60 


qgaacc natc 


tcqccatccq 


gctgccc^tt 


cgcagcggcc 


tggcagggee 


accccLggcc 


123 


■r tgaggcLgc 


ccccggagac 


tgacgaggaa 


teggaggage 


ctggccggag 


aggcagnt tt 


18H 


gtggagatgg 


tggacaa.ee t 


gaegggaaag 


tccggc.agg 


gctactatgt 


ggagatgacc 


240 


gtaggcagee 


ccccacagac 


gc tcaacatc 


c tggtggaca 


egggcagtag 


taactttgea 


JO'J 




ccccacaccc 


tt tectgeat 


cgc tactacc 


agaggcagct 


gtccdgcaca 


35C 


Latcqagacc 


tccgaaaggq 


tgtqtacqLq 


ccc tacaccc 


agqgcaagtg 


gg^gggggaa 


420 


- tgggcaccg 


acctggtgag 


catccctcat 


ggccccaacg 


tcactgtgcg 


tgecaacatt 


480 


jc tgecatea 


ctga^tcgga 


caagttctcc 


a tcaatgg 1 1 


ccaac tggga 


gggcau.ee ta 


040 


■ijggctggcct 


atgetgagat 


tgccaggccc 


gacgactct t 


tggagcect t 


ctttgactcc 


6C0 
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ctggtgaagc agacccacat tcccaacatc 
cccetcaacc agaccgaygc actgycctcg 
gaccactcgc tatacacggg cagtctctgg 
gaagr.gatca ttgiacgtgt ggaaatcaat 
Lacaacnacg a~aaqagca*_ tg'ggacaqu 
gtatt tgaag ctgccgtcaa gtccatcaag 
agcntttggc taggggagca gczggtgtgc 
ttc-cagtca tttzacttta cdcatgggt 
otcc-tcct-: agcaatacct acqgccggtg 
r acaagttcg ctg-ctcacn gtcatccacg 
ngtt tctatg tcgtcltcga tcgagcccga 
-atgtgcacg atgagttcag gacggcggca 
uaaaactgtg gctacaacat ti^cccagaca 
gtcatggcgg ccarctgcgc cr:tctt.catg 
■.gctgcctgc gttycctc-y c^accagcac 
aagtaaggag gctcgtgggc agatgatgga 
■rtttggtcac atgagtrgja gctatggatg 
■:accaacctg ccaat:gcttc tggcgtgaca 
gggcttgcac ct.gtaggaca caggagacgg 
■rcttaggcac cacaaacttg agt.ggaaat 
■_ tgc-cagca tcctztagag tctccaacc: 
tggcgtcata ctcaggctac ccggcatgtg 
aatcccatLc cctgctggcc aaagtcagca 
luatagggac tgcagactca agcctacact 
gaa 

*210> a 

-211> 501 

*212> PRT 

•213 > Mus musculus 



trttccctgr agctc*_gtgg cgctggcttc 660 
gtgggaggga gcatgatcat tggUjgtatc: 0 
tacacaccca tccggcggya y Lggtatuat. 780 
ggtcaagatc tc.iagatgga ctgcaaggag S40 
gygaccacca accttcgctt gcccaagaaa you 
gcagcctcct cgacggagaa gttcccggat S6C 
tg^/caagcag gcacgacccc ttggaacatt 1C20 
gaagccacca atcagtcct" ccgcat:acc :080 
gaggacgtgg ccacgtccca agacgactgt 1140 
ggcactgtra tgggagccgt cat^atggaa 1200 
aagcgaattg gctttgctgt caccgcttigc 1260 
gzggaaggtc cgtttgttac ggcagacazg 132C 
gatgagtcaa cact.tatgac catagcctat 1J80 
ttgccactc:. gcctcatggt atgtcagtgg 144C 
ga f „gactttg c'gatgacat ctccctgctc 1500 
gacgcccctj gaccacaict gggtggttcc 1560 
gtacctctgg ccagcgcac: tcaggaccct 1620 
gaacagagaa atcaggcaag ctggattaca 16 30 
aaggaagcag cgr.tctggtg gcaggaatat 1740 
tt^gctcctt gaagcttcag ccctgacc:: 1300 
aaagtattct n-atgtcct t ccacaagtac ItibO 
t=rctgtggt accctggcag agaaaggg^c 1920 
gaagaaggtg aaylCtgcca gt tcctt tag 1930 
ggtiacaaaga ctgcgt ~ttg agataaacaa 2040 

2043 
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<4 0 0> 0 

Met Ala Pro Ala ^eu His Trp Leu 

1 5 

Leu ?rn Ala Gin Gly Thr His Leu 
20 

Gly Leu Ala Gly Pro Pro Leu Gly 
3 5 40 

Glu Glu Scr Clu Glu Pre Gly Arg 
5C 55 

Asp Asn Leu Arg Gly Lys Ser Gly 

65 70 

val Gly Sftr Pro Pro Gin Thr Leu 
85 



Leu Leu Trp Val Gly Ser Gly Met 
10 1^ 

Gly Lie Arg Leu Pre Leu Arg Ser 
25 3 0 

Leu Arg Leu Pre Arg Glu Thr Asp 
4S 

Arg Gly Ser Phe Val Glu Mot Val 

60 

Gin Gly Tyr Tyr Val Glu Met Thr 

75 80 

Asn lie Leu Val Asp Thr Gly Ser 



Ser As:: Phe Ala Val Gly Ala Ala 
100 

Tyr Gin Arg Gin Leu Ser £er Thr 
115 120 

Tyr Val Pro Tyr Thr Gin Gly Lys 
130 135 

Leu Val Ser lie Fro His Gly Pro 
145 15 0 



Pro His Pio Phe Leu Kits Ax g Tyr 
10n 110 

Tyr Ar:r Asp Leu Arg Lys Gly Val 
125 

Trp Glu Gly G^u Leu Gly Thr Asp 
140 

Asr. Val Thr Val Arg Ala Asn He 
155 160 
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Ala Ala He Thr 

GLu Gly lie Leu 
18C 

:"er L^u Glu. Pro 
195 

Asn He Phe Ser 
210 

Thr Gla Ala Leu 

Asp His Ser Leu 

Jlu 'Irp Tyr Tyr 
260 

Asp Leu Lys Met 

275 

Asp Ser Gly Thr 
Ala Val Lys Ser 
Gly Phe Trp Leu 



Glu Cer Asp lys 

165 

Gly Leu Ala Tyr 

Phe ?h£ Asp Ser 

203 

Leu Gin Leu Cys 
215 

Ala ^er Val Gly 
230 

Tyr Thr Gly Ser 
245 

Glu Val He He 

Asp Gys Lys Glu 

28G 

Thr Asn Leu Arg 

29b 

He Lys Ala Ala 
313 

Gly Glu Gin Leu 



Phe Phe lie Asn 

17C 

Ala Glu He Ala. 

Leu Val Lys Gin 

Gly Ala Gly Phe 
220 

Gly Ser Ko: He 
235 

Leu Trp Tyr Thr 

250 

Val Arg Val Glu 
265 

Tyr Asn Tyr Asp 

Leu Pro Lys Lys 

300 

Ser ser Thr Glu 
Val Cys Trp Gin 



Gly Eer Asn Trp 
17 5 

Arg Vrc Asp Asp 

190 

Thr His He Pre- 

Pro Lflii Asn Gin 

11 g Gly Gly He 
240 

Pro He Ar rf Arg 
2jG 

He Asn Gly Gin 

27C 

Lys Ser He Val 
285 

Veil Phe Glu Ala 

Lys Phe Pro Asp 
320 

Ala Gly Thr Tnr 
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325 330 335 

Pro Trp Asn He Pho Pre Val lie Ser L^u Tyr Leu Mot Gly Glu Val 
340 34b 3 50 

Thr Asn Gl~ Ser Phe Arg lie Thr He Leu Pro Gin Gin Tyr Leu Arg 
355 360 365 

Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala 
370 375 380 

val ser Gin Ser Ser Thr Gly Thr val Me- Gly Ala Val He Met Glu 

3 85 .iqr. 3 9 5 ^00 

u:y Fhe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala 
405 41 C 415 

Val Ser Ala Cys His Val His Asp Glu Phe Arg Tar Ala Ala Val. Glu 

420 425 430 

Oly Pro Php Vr.l Thr Ala Asp M«t Glu Asp Cys Gly Tyr Asn TIp 1 Prn 
435 440 445 

Gin Thr Asp Glu Ser Thr Leu Mat Thr He Ala Tyr Val Met Ala Ala 
450 455 460 

He Cys Ala Leu Phe Met Leu Pro Leu Cys Leu Met Val Cys Gin Trp 

4 65 47U 47 5 480 

Arg Cys Leu Arc Cys Leu Aty His Gl:; His Asp Asp Phe Ala Asp Asp 
48$ 49C 495 



19 



WO 00/17369 



PCT/US99/20881 



lie Ser Leu Leu Lys 

5 0 0 

-210> 9 
<211> 2068 
-:>12> DNA 

•'213 > Home sapiens 
-•100> 9 

atycuycccy ytttggcacL gctcctigclg yccgcclyya cggctcgggc gctggaggta 60 
i-ccar:r.rj.ir.c gtaargc t-.gg ncr gorggrr gaar.rcraga r rgccat-gtr ctgtggcaga 120 
■: tgaacatgc acatyaatyt ccayaatygg aagtgggatt cagatccatc aggcaccaaa ISO 
a-ctgeattg atiaccaagca aggcatcctg cagtattgee aagaagtcta ccctgaactg 24 0 
■:agatca:ca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgeaagegg 3 0C 
■\gccgcaagc aytycaag^c ccat-jcccac tttgtgattc ccuccycty cttagttggt 36C 
cagtttgLaa ytgatgeect tctcgttcct gaeaagegea aattcttaca ccaggagagg 42C 
i-ggatgttt gegaaactea tcrtcactgg cacaccgtcg ccaaagagac argcagtgag 48C 
aayagLa:ca acttgeatga ctaeggcatg ttgctgccct gcggaattga caagttccga 540 
ggggtvagagt- ttrgtgtgtty cccactggrt gaagaaagtg acaatgtyya tte'ectgat 600 
geggaggagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcacatggg 660 
agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggL ggaacaagaa 72 0 
gaagc-gatg atgacgagga cgatgaggat ggtgatgagg tagaggaaga ggctgaggaa 7 80 
-cctacgaag aagecacaga gagaaccac^ agraitgcca :caccaccac caccaccaca 840 
gagtctgtgg aagaggtggt tcgagttcct: acaacagcag ccatjtacccc tgatgccgtt 900 
gacaagtatc tcgagacacc tggggatgag aatigaacatg rzcatttcca gaaagccaaa 96 0 
gagaggcttg aggecaagea ccgagagaga atgtcccagg tcatgagaga atgggaagag 1020 
ycagaacrgtc aagcaaagaa ettgectaaa gctgataaga aggcagttat :cagcat:ttc 1080 
caygayaaag tggaatctt L yyaacaggaa ycayccaacy agagacagca yctyylygag 114 0 
acacacatgg ccagngtgga agccargctc aatgaccgcc gccgcctggc -ctggagaac 1200 
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Ucatcaccg 


ctctgcaggc 


tgt tcctcct 


cggcctcgtc 


acgtgttcaa 


tatgetaaag 


12 6 0 


aagtatg ice 


gcgcagaaca 


gaaqgacaga 


cagcacaccc 


taaagcatL t 


cgagcatgtg 


1320 


cgcatggtgg 


atcccaagaa 


agccgctcag 


at ccggtccc 


aggt tatgac 


acacctccgt 


13 RT. 


g tgat^Uatg 


agcgca »ga.a 


tca:jl'tct.c 


tccctgc tct 


3 :aacc "gc z 


r.gcagtgy = c 


144C 


uaggagat tc 


aggatgaay t 


tgatgagctg 


cttcagaaag 


agcaaaacta 


ttcagatgac 


1500 


gt cttggcca 


acatga: tag 


tgaac 3aagg 


atcag* tacg 


gaaacgatgc 


t ctcatgcca 


1 5 b 0 


lc - ttgaccg 


aaacrjaaaac 


cac::gtggag 


cr.ccttcccg 


tgaatggaga 


gttcagcctg 


1620 


■ jacgatc: jc 


agccgtggca 


ttct tttggy 


gctgac tc tg 


tgccagccaa 


cacagaaaac 


16£C 


jaagttgagc 


ctgttgatgc 


ccgcc -tget 


gccgaccgag 


gactgaccac 


tcgaccaggt 




■_c tgggtt^a 


caaatatcaa 


gaeggaggag 


at~tc tgaag 


tgaagatgga 


tgeagaatte 


1 8 1 j 0 


ogacatga zt 


caggatatga 


agtt z:atcat 


caaaaattgg 


tgttctttgc 


agaagatgtg 


18 6 0 


•;c ttcaaa :a 


aaggtgcaat 


cat: tag acLi.; 


atgg-gggcg 


gtgt tgtcat 


agcgacagt.g 


192C 


•:tt cgtcatca 


cct.ggtgat 


gctgaagaag 


aaacagtaca 


catccattca 


tcatcgtgtg 


19SC 


guggaggttg 


acgccgctgt 


caccccagag 


gagrgccacc 


tgtccaagat 


gcagcagaac 


2 04 0 


gyctacgaaa 


atccaaccta 


caagttcttt 


gagcagatgc 


agaac tag 




2085 



<210> 10 
v211> 69 5 
-:212> PRT 

<2 13> Homo sapiens 

<4CC> 1C 

Met Leu Pro Gly lieu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu Giu Val Prr, Thr Asp Gly Asn Ala Giy Leu Leu Ala Glu Pre 

20 2G 30 

■:V_n lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Glr. 
35 40 45 
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Asn Gly Lys Trp 

50 

Thr Lys Glu Gly 

6b 

Gin lie Thr As:: 

Trp cys Lys Ary 

100 

1 1 Prn Tyr Arc: 
115 

Val Pro Asp Lys 

13 C 

Clu Thr Hie Leu 
145 

Lys Ser Thr Asn 

Asp Lys Phe Arg 

13 0 

:*;er Asp Asr. Val 
195 

Trp Trp Gly Gly 



Asp Scr Asp Pre 
55 

lie Leu Gin Tyr 

7 0 

Val Val Glu Ala 
S5 

Gly Arg Lys Gin 

Cys Lpu Val Gly 
120 

Cys Lys ?ho Leu 
135 

His Trp Hie Thr 

150 

Leu His Asp Tyr 

165 

Gly Val Glu Phe 

Asp Ser Aia Asp 

2 00 

A 1 a Asp Thr Asp 



£or Gly Thr Lys 

6:: 

Cys Gin Glu Val 
75 

Asn Gin Pro Val 

90 

Cys Lys Thr His 
105 

Glu Phe Val Ser 

His Gin Glu Arg 
140 

Val Ala Lys Glu 
155 

Gly Net Leu Leu 

170 

Val Cys Cys Pre 
185 

Ala Glu Glu Asp 
Tyr Ala Asp Gly 



Thr Cys lie Asp 

Tyr Pro G^u Leu 

3 0 

Thr lie Gin Asn 

95 

Pro His Phe Val 
1] 0 

Asp Ala i.eu I -Pi'. ; 
125 

Met Asp Val cys 

Thr Cys SGr Glu 
160 

Pro Cys Gly lie 
175 

Leu Ala Glu Glu 
190 

Asp Ser Asp Val 
205 

Ser Glu Asp Lys 
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21C 



220 



Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu clu 
225 22C 235 240 



Glu Ala Asp Asp Asp Glu Asp Asp Clu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu ser val Glu Glu Val Val Arc 
2 7 5 2 a 0 2 3 S 



Val Pro inr Thr Ala Ala Ser Thr Prs Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

Glu Thr Pro Gly Asp Glu Asr. Glu His Ala His Fhc Gin Lys Ala Lys 
-05 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Mot Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Al* Lys Asr. L«u Pro Lys Ala Asp 

340 345 3 5 C 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 36b 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His KcL Ala 

370 " 380 
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Arg Val Glu Ala. Met Leu Asn A?p Arg Ary Arg Leu Ala Lea Glu Asa 
j 95 390 39b 40 0 

Tyr II© Thr Ala l,eu Gin Ala Val Prr> Pro Anj Pm Arg His Val Phe? 

4C5 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
423 425 430 

".hr Leu Lys His Phe Glu His Val Arg Met Val Asp Pre Lys Lys Ala 
4 3b 4 4 U 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val lie Tyr Glu 
450 45b 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pre Ala Val Ala 
465 47C 475 4S0 

•Jiu Glu lie Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 

7Yr Ser Asp Asp Val Leu Ala Asn Met lie Ser Glu Pro Arg lie Ser 
SCO 5C5 510 

Tyr Gly Asn Asp Ala Leu Ket Pro Ser Leu Thr Glu Thr Lys Thr Thr 

515 520 525 

Val Glu Leu Leu Pr^ Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 

530 535 540 
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Pro Trp His lier Phc Gly Ala Asp Sor Va 1 Pro Aia Asn Thr Clu Ad:i 

:-45 550 555 560 

Glu Val Glu Pro Val Asp Aia Arg Pro Ala Ala Asp Arg Sly Leu Thr 

5 55 57 0 57 5 



Thr Ary Pre Gly 

580 

Glu Val Lys Met 
535 

«is His G* n r ys 
610 

Gly Ala 11*5 lie 
■".2 5 

lie Val lie Thr 

His Hi a Gly Val 
660 



Ser Gly Leu Thr 

Asp Ala Glu P.ie 
60C 

Leu Val Pho Phe 
615 

Gly Leu Ket Val 

63 C 

Leu Val Met Leu 
645 

Val Glu Val Asp 



Asn lie Lys Thr 
585 

Arg His Asp Ser 

Ala Glu Asp Val 

620 

Gly Gly Val Val 
6j5 

Lys Lys Lys Gin 

65^ 

Ala Ma Val Thr 

665 



Glu Glu Ily Sei 

590 

Gly Tyr Glu Va^ 

Gly Ser Asn Lys 

I e Ala Th r Va 1 
640 

Tyr Thr Ser lie 
0:5 

Pre Glu Glu Arg 

67 0 



His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 630 635 

Phc Phe Glu Gin Met Gin Asn 
690 695 
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< 2 1 'J > 11 
<211> 2 088 
- :■'> DNA 
2 1 3 > Huiu'j sapiens 

a-.a:tgcccq qt t -gqcact qcicctqctq qccqcctgga cggctcgggc gctggaggta t'J 
-j.jcactgatg gtaatgctgg cc-gctggct gaaccc-aga ttgccatgtt ctgtggcaga 120 
-cgaacatgc acatgaatgL czagaatygg aagtgggatt cagatccsitc agcgaccaaa ISO 
acctgcat.Lg ataccaagga aqqcatcotg cagtattgcc aagaagtcta ccctgaa-tg 240 
cagatcacca atgtggtaga agccaaccaa ccagtcacca ::cagaact.g gLccaag = gg 3 00 
ggccgcaagc ag^gcaacac cca'.ccccac t-tgtcattc cctaccgctg cttaq^Lggt 3 60 
gagi-~t_gtaa gtga-grcrt frr rgf.r. ~cf. gacaagtgra aal-.r.rrfcaca rcaggagagg 42C 
atggatgttt gcgaaactca tctucactgg cacaccgtcg ccaaagagac atgcagtgag 430 
cagagtacca acttgcatga ctacggcatg ttgctgccct gcggaattga oaacttccga 540 
ggggtagagt ttgtgrg.tg cccactggct gaagaaagtg acaatgtgga ttctgctga. 6CO 
■;cyyagyagg atgactcgga tgtctggtgg ggcggagcag acacagacta tgcagatggg 6CC 
a gtgaagaca aa^tagtaca aytagcagag gaggaagaag tggctcaggt cgaagaagaa 7^1 
gaagccgatg at^a^gagga cgatgaggat ggcgatgagg tagagcaaga ggcrgaggaa 7 8 C 
-cctacgaag aagc=acaga gagaaccacc aqcaltgcca ::accaccac caccaccaca 34 C 
gagtctgtgg aagaggtggf tcgagt tret anaacagcag r.cagt.ac.ccr. r.gatgccgtt 900 
gacaagtatc tcgagacacc tggggatgag aatgaacatg cccatttcca gaaagccaaa 960 
gagaggcttg aggccaagca ccgagagaga atgtcccagg tcatgagaqa aigggaagag 102 0 
gcagaacgtc aagcaaagaa cttgactaaa gctgataaga aggcagttat ccagcatttc 1080 
caggagaaag OggaaLcttt. ggaa-aggaa gcagceaacg agagacagca gctggtggag 1 1*»C- 
acacacatgg r.cagagtgga agccacgctc aatgaccgcc gccgcctggc cctggagaac 12 00 
lacatcaccg ctctgcaggc tgttcctcct cggcctcgtc acgtgttcaa tatgctaaag 12b0 
aaccatgtcc gcgcagaaca gaaggacaga cagsacaccc taaagcattt cgagcatgtg 13 2 0 
cgcatggtgg atcccaagaa agccgctcag atccggtccc aggctatgac acacctccgt 1380 
jtcatt'-aug accgcaogaa LcagL'j lclc LcccLgcUcl acaacgtycc tyuagtggcc 1440 
gaggagattc aggatgaagt tgatgagctg cttcagaaag agcaaaacta ttcagatgac 15 0 3 
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ytcttggcca acatgattag tgaaccaagg at-actta~g yaaacgargc tc^catqcca 15£0 

tcttigacca aaa-gaaaac caccgtagaa ctcc ttcccg -gaatgcaga gttcaqcct} 1620 

yacgatctcc agccgtggca ttctt.tggg gcLgac.ctg Lgccayccaa cacagaaaac 1680 

gaagttgagc ctgttgatyc ccgccctgct gccgac :gag gactgaccac tcgaccaggt 17-10 

tctgggttga caaatatoaa gacggaggag atctctjaag tgaatctgga tgcagaattc 1300 

cgacatgac^ caggaU-ga ag:Lcatcat caaaaattgg tgttctttgc agaagatgtg 1863 

g^ttcaaaca aaqgtqcaat cattggaclc a~ggtgggcg gtgttgtcat agcgacagtg 132 0 

atcgtcatca ccttggtgat gctgaagaag aaacagtaca catccattca tcatggtgtg 198C 

cjtggaggttig acgccgctgt caccccagag yagcgccacc tgtccaagat gcagcagaac 2 04C 

ggctacgaaa atccaaccla caagttctct gagcagatgc agaactag 203S 

-■21C> 12 
-.211:- fi u 5 
•■212> PPiT 

- 2 1 3 > Homo sapiens 
<40Q> 12 

Met Leu Pre Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp TLr Ala Arc 
15 10 15 

Ala Leu Glu Val Pro Thr Asp r.ly Af=n Al^ Gly Leu Levi Ala Gin Pro 

20 25 20 

Gin He Ala Met Phe Cys Gly Arg Leu Asr. Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys ?nr Cys He Asp 

50 S5 60 

Thr Lys Glu Gly He Leu Gin Tyr Cy* Gin Glu Val Tyr Pro Glu Leu 

55 70 75 B0 

27 



WO 00/17369 



PCT/US99/20881 



Glr. lie Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr lie Gin Asn 
85 90 95 

Trp cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro Hi.- Phe Val 
100 ICS 110 

:la Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Mot Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 

145 15C 15=5 160 

_,ys Ser Thr Asn Leu His Asp "yr Giy Met Leu Leu Pro Cyrs Gly lie 
16b 170 17b 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pre Leu Ala Glu Glu 
1811 185 190 

■:er Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Eer Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 
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Glu Ala Asp Asp Asp Gtu Asp Anp Clu Asp Cly Asp Clu Val Clu Clu 
245 250 255 

Glu Ala Glu Glu Pre Tyr Glu Glu Ala Thr Glu Arg Thr Thr 5er lie 
263 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Veil Glu Glu Val Val Aru 

275 280 2H5 

Val Pro Thr T'hr Ala Ala Ser Thr Pro Asp Ala Val Asp ^ys Tyr Leu 
290 295 300 

Glu Thr Pre Gly Asp C-lu Asn Glu His A.a His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg G'wi Arg Mpr. Snr Gin Val Mc-t. Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pre Lys Ala Asp 

3 40 345 35 0 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val G^u Thr His Met Ala 

37 0 375 38C 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 

385 39C 395 4C0 

Tyr lie Thr A-a Leu Gin Ala Va. Pro Pro Arg Pro Arg His Val Pr.e 
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Asn Met Leu Lys Lys r iyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 



Thr Leu Lys Mis Phe Glu His Val Arg Mel Val Asp Pro Lys Lys Ala 
435 440 445 



Ala Gin lie Arg Sor Gin Val Met: Thr His Lgu Arg Val Jle Tyr Glu 
45C 455 460 



Arg Met Asr. Glu Sor Leu Ser Leu Lea Tyr A sr. Val Pro Ala Val Ala 
^65 470 47- 4*0 

.;lu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 

500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Sor Leu Thr Glu Thr Lys Thr Thr 

515 52C 525 

Val Glu Lqu Lgu Pro Val Asn Giy Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 

Pre Trp His Ser Phe Gly Ala Asp Ger Val Pro Ala Asn Thr Glu Asn 

54 5 55 0 555 5fcU 

Glu Val Glu Pre Val Asp Ala Arg Pru Ala Ala Asp Arg Gly Leu T:.i 

565 570 575 
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Thr Arg Pro Gly Ser Gly Leu .hr 
580 



Glu Val Asn Ls'i Asp Ala Glu Phe 
535 6C0 



His His Gin Lys Leu Val Phe Phe 
610 615 



Gly Ala lie I la Gly Lou Met Val 

•: 2 1 > 6 3 0 



Asn He Lys Thr Glu Glu lift ser 
535 590 

Arg His Asp Ser Gly Tyr Glu Val 
605 

Ala Glu Asp Val Gly Ser Asn Lys 
620 

Gly Gly Val Val 11 q Ala Thr Val 
635 640 



lie Val He Thr Leu Val Met Leu Lys Lys Ly3 Gin Tyr Thr Ser lie 

645 650 655 

His Hi£ Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 

bbu 665 6/0 



:iis Leu Sex Lys Met Gin Glu Asn Gly Tyr Glu Asa Piu Thr Tyr Lys 
675 680 6flS 

Phe Ph^ Glu Gin Met Gin Asn 

690 695 



^210> 13 
'-211> 2088 
<212> DNA 

<213> Hcnx sapiens 
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<4 c;o> 13 

atyctgcccg gtttggcact gctcctgctg 
cccactgatg gtaatgctgg cctgctggct 
c'_yaacatyc acatqaaLyL ccacjddLqyu 
a-ctgca^g atraeraagga aggcatc c tig 
cagatcacca atgtggtaga agccaaccaa 
ggccgcaagc agtgcaagac ccaicccca 
yagC-tqtaa gtgatgcccL tcLcgtccct 
atyyatyttt gcgaaactca tcttcactgj 
aagagtanra ant tgna r. ga rfacgg^atg 
ggggragagt ttgtgtgttg cccactggct 
gcggaggagg argactcgga t.grct.ggcgg 
agtgaayaca aactagtaga actagcagag 
yaagccgatg atgacgagga cgatgaggat 
-rcccacgaag aagccacaga gagaaccacc 
gagtctgtgg aagaggtggt tcgagttcct 
■jacaayLaLc tcgagacacc Lgyggatgay 
gagaggcttg aggccaagca ccgagagaga 
gcagaacgtc aagcaasgaa cttgcctaaa 
■:aggagaaag tggaatcttt ggaacaggaa 
acacacatgg ccacagtgga agccatgctc 
".acatcaccg ctctgcayyc tgttcctcct 
aagtatgtcc gcgcagaaca gaaggacaga 
■:gcatggLgy a:cccaagaa agccgc^cag 
gtgatttarg agcgcatgaa tcagiccctc 
gaggagattc aggatgaagt tgatgagctg 
gtcttcgcca acatgattag tgaaccaagg 
tctttgaccg aaacgaaaac caccgLygag 
jacgatctcc agccgtggca ttcttttggg 
jaagttgagc ctgttgatgc ccgccctgcz. 
"ctgggttga caaatatcaa gacggaggag 



gccgcctgga cggctccggc gctggaggca 6C 
gaaccccaga ttgccatgtt ctgtggcaga 120 
daylyygaLl cjagaUuaLc ay ygdccadd ISO 
cagtattgcc aagaagtcta ccctgaactg 240 
ccagtgacca tccagaacrg gtgcaagcgg iOO 
trtgtgattc cctaccgctg ctzagttggt .16 'J 
gacaagtgca aat:c:taca ccaggacagg 4 2 C 
cacaccgtcg ccaaagagac atgcaytgay 480 
ntgctgccct gcggaaf.tga raagf.f.r.cga r >4 0 
gaagaaagtg acaatgtgga ttetgctgat 500 
ggrggagcag acacagacta tgcagatggg obO 
gaggaagaac tgg-j~gaggt ggaagaagaa 72 0 
ggcgatgagg tagaggaaga ggc-gaggaa 760 
agcattgcca ccaccaccac caccaccaca 840 
acaacagcag ccagtacccc tgatgccctt 900 
aaLyaacatq cccdt'.Lccd gaaagccaaa 963 
atgrcccagg tcatigagaga atgggaagag 10 TO 
gctgataaga aggcagttat ccagcatttc IDfcO 
gcagccaacg agagacagca gctggtggag 114 0 
aatgaccgcc gccg~ctggc cctggagaac 120 0 
ccgcctcgtc acgtgttcaa tatgctaaag 12 6 C 
cz.3caca.ccc taaagcattt cgagcatgtg 13J0 
atccggzc.c aggttatgac acacctccgt 1330 
zccczgctct acaacgtgcc tgcagtggcc 14-10 
cttcagaaag agcaaaacta ttcagatgac IS 'JO 
atcagttacg gaaacgatgc tctcatgcca 15G0 
ctccttcccg tgaatggaga gttcagcctg 1520 
gctgactctg tgccagccaa cacagaaaac 1690 
gccgaccgag gaccgaccac tcgaccagg- 1740 
atctcr.gaag tgaagatgga tgcagaattc 18 01 
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cga-jatgact ^agyatalga agrtcatcat cauaaattgy tgttctttgc agaagatgtg 1360 

cjgttcaaaca aaggtgcaat cattggactc atggt.gggcg gtgttgtca- agcgacagtg 192 0 

citcLtcatca ccttggtgat gcrgaagaag aaacactaca catccattca tcatygtg-g 1980 

gtggaggttg acgccgctgt caccccagag gagcgccacc tgtccaagan gcagcagaac 2040 

ygctacgaaa at-caaccta caagrtcttt gagcagatgc agaactag 20BS 

• 2: [)> 14 
<211> 6 95 
- 212> PI\T 

«2i;s> Home sapiens 
-40O 14 

Ket Lou Pre Gly Leu Ala Leu Leu Leu ^eu Ala Ala Trp Thr Ala Arg 
lb 20 IE 

Ala L*u Glu Val Pro Thr Asp Gly Asn Ala Gly Lsu L&u Ala Glu Pro 

20 2j 30 

Gin lie Ala Met ?he Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
3 5 4 0 4 5 

Asn Gly Lys Trp Asp Ser Asp Pre Scr Gly Thr Lys Thr Cys He Asp 

5 2 5 5 6 0 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pru Glu Lau 
fc5 7C 75 SO 

Gin He vhr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 9 0 9 5 

Trp Cys Lys Arg Gly Ary Lys Gin Cys L,ys Thr His Pro His ?he Val 
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ICO 



10b 



110 



lie Pro Tyi Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 

:i: 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
13 0 13 5 14 n 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 

14 j l 5 C 155 1 tjU 

Lys Ser Thr Asn Le. His Asp Tyr Gly Met. Leu Leu Pre Cys Gly lie 
165 170 175 

Asp Lys Phe Arq Gly Val Glu Phe Val cys Cys Pro Leu Ala Glu Glu 
180 185 190 

^er Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp ser Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
j25 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Clu Val Glu Glu 
245 250 255 



Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 ~ 265 270 
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Ala Thr Thr Thr 
275 

Vsl Pre Thr Thr 
290 

Clu Thr Pro Cly 

;^ 05 

Glu Arg Leu Glu 

Glu Trp G^u Glu 
340 

Lys Lys Ala Val 
35S 

Gin Glu Ala Ala 

370 

Arc Val Glu Ala 
385 

'Vyx He Thr Ala 

Asn Met Leu Lys 

420 



Thr Thr Thr Thr 

280 

Ala Ala Ser Thr 
295 

Asp Glu Asn Glu 
3 10 

Ala Lys His Arc; 
3 2 5 

Ala Glu Arg Gin 

He Glr. His ?he 
3 6n 

Asn Glu Arg Gin 
375 

Met Leu Asn Asp 
390 

Leu Gin Ala Val 
405 

Lys Tyr Val Arg 



Glu Ser Val Glu 

Pre Asp Ala Val 
330 

His Ala Hie Phe 
315 

Glu Arg Met Ser 
33 U 

Ala Lys Asn Leu 
345 

Gin Glu Lys Val 

Gin Leu Val Glu 

380 

Arg Arg Arg Leu 
3 95 

Pro Pro Arg Pro 
410 

Ala Glu Gin Lys 

425 



Glu Val Val Arg 
2&5 

Asp Lys Tyr Leu 

Gin Lys Ala Lys 
32C 

Gin Val Met Arg 

335 

Pro Lys Aid Asp 

350 

Glu ser Leu Glu 
365 

Thr his Met Ala 

Ala Leu Glu Asn 
40C 

Arg Kis Val Phe 
41b 

Asp Arg Gin His 
430 
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Thr Le\ Lys His 
43 5 

Aid Gin Ila Arg 
450 

Arg v e t Asn Gin 
Glu Glu Tie Gin 

Tyr Ser Asp Asp 

r j0O 

"V r g1 y Asr - As f 

Val Glu Leu Leu 

53C 

Pro Trp His Ser 
545 

Glu Val Glu Pro 

Thr Artj Pro Gly 

bao 

G'.u Val Lys Met 



Phe Glu His Val 
441 

Ser Gin Val Met 
455 

Ser Leu Ser Leu 
47C 

Asp Glu Val Asp 
435 

Val Lgu Ala Asr. 

Ala Leu Mot Pro 
520 

Pro Val Asn Gly 
535 

Phe Gly Ala Asp 
55C 

Val Asp Ala Arg 
565 

Ser Gly Leu Thr 

Asp Ala Glu Phe 



Arg Met V*l Asp 

Thr His Leu Arg 

460 

Leu Tyr Asn Val 
475 

Glu Leu L*=u Gin 
490 

Met lie Ser Glu 
505 

Ser Leu Thr Glu 

Glu Phe Ser Leu 
540 

Ser Val Pro Ala 

555 

Pro Ala Ala Asp 
570 

Asn lie Lys Thr 
585 

Arg His Asp Ser 



Pro Lys lys Ala 
44 5 

Val He Tyr Glu 

Pre Ala Val A-a 

46G 

Ly? Glu Gin Asn 
495 

Pro Arg lis Ser 
510 

Thr Lys Thr Thr 
525 

Asp Asp Leu Gin 

Asn Thr Glu Asn 
560 

Arg Gly Leu Thr 
575 

Glu Glu He Ser 
590 

Gly Tyr Glu Val 
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595 60C 635 

His His Gin Lys Leu Val ?he The Ala Glu Asp Val Gly 5er Asn Lya 
610 515 620 

Gly Ala lie lie Gly Leu Met Val Gly Gly Val Val lie Ala Thr Val 
62b 633 635 643 

lie Phe He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

His H.s Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 66n 670 

His Leu Ser Lys Met Gin Gin Asr. Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 6SC 685 

Pbe Phe Glu Gin Met Gin Asn 
630 635 

-210> 15 
^211> 2094 
•■■212> DNA 

<213> Homo sapiens 
<400> 15 

atgctgcccg gr.r.r.ggcar t_ gctcctgctg gc^gcctgga eggrtegggr getggaggra 6 0 
rccactgatg gtaatgctgg cctgctggct gaaccccaga ttgccatgtt ctgtggcaga 120 
ctgaacatqc acatgaatqt ccaqaatqqq aaatqqqatt caqatccatc aqggaccaaa 18u 
acctgeattg ataccaagga aggcatcctg cagtattgee aagaagtcta ccctcaactg 240 
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ragatca-ca atgtggtaga agccaaccaa 
qgccgcaayc aytgcaagac ccatccccac 
gagtttgtaa gtgatgccct tctcgttcct 
atyyatgLtt ycyaaac tea tcttcactgg 
r^gagharca actf.gcatga etaeggcatg 
giggtagagt ttgtgtgttg cccactggct 
cegqaqgagg atgactcqqa tqtctgqtqq 
j,jtcaagaca aagtagtaga agtagcagag 
gaagccgatg atgacgagga c gat gag gat 
;cc Lacgaag aagecacaga gagaaccacc 
gagrctgtgg aagaggtggt tcgagttcr.t. 
jacaagtatc tcgagacacc tegggacgag 
gagaggctrg aggncaagca rngagagagn 
qcayaacgtc aaccaaagaa ettgectaaa 
caggagaaag tggaatctit ggaacaggaa 
vicacacatgg ccagagtgga agecatgz-c 
r.acatcacjy utctycaygc tyttcctcct 
aagtatgtrc gegcagaaca gaaggacaga 
■rgcatggtgg atcccaagaa ageegctsag 
gtgatttatg agegcatgaa tcagtctctc 
gaggagat tc aggatgaagt tgatgagctg 
gtcttggc^a acatgattag tgaaccaagg 
tctttcaccg aaa:gaaaa: caccgtggag 
gacgatctcc agccgtggca ttcttttggg 
jaagttgagc ctytLgatgc ccgccctgct 
tctgggttga caaatatcaa gaeggaygag 
rgacatgact caggatatga agttcatcat 
ggttcaaaca aaggtgcaat cattggactc 
ategtcatea ccttggtgat gctgaag^ag 
.jtgqaggttg acgccgctgt caccccagag 
ggctacgaaa atcraaccta caagttcttt 



ccagtgacca tccagaactg gtgeaagegg 300 
tttgtgattc cctaccgctq cttagttggt 3 6 3 
gaeaagtgea aatt-ttaca ccaggagagg 42C 
cacaccgtcg ccaaagagac atgcagtgag 480 
ttgctgccct goggaattga caagttccga 54 0 
gaagaaagtg acaatgtgca ttctgc tgat 600 
qqcqqaqcaq acacaqacta tqcaqatqqq 660 
gaggaagaag tggctgagct ggaagaagaa 72 0 
qgtgatgagy tayaggaaga ggctgaggaa 7 8 0 
ageattgeca ccaccaccac caccaccaca 34 0 
acaacagcag ccagtacccc tgatgccgtt 9 0 n 
aatgaacatg cecal ticca gaaagccaaa 96 0 
atrgtrencagg rratgagaga al-gggaagag 102 0 
gctgataaga aygcagttat ccagcatttc 1030 
gcagccaacg agagacagca gctggtggag 114 0 
aatgaccycc gccgcctggc cctggagaac 1200 
cggcctcgtc acgtgntcaa tatgetaaag 12 6 0 
cagcacaccc laaaycattt cgagcatgtg 132 0 
atocggtccc aggttatgac aca:ctccgt 1380 
tccctgctct acaacgtgcc tgcagtggcc 1440 
ctteagaaag agcaaaacta f. tcagatgac ISO f; 
atcagttacg gaaacgatgc tctcatgcca 1550 
ctccttcccg tgaatggaga qttcagcctg 1620 
gctgactctg tgccagccaa cacagaaaac 1630 
gccgaccgag gactgaccac tcgaccaggt 1740 
atctctgaag tgaagatgga tgeayaatte 18C0 
caaaaattgg tgttctttgc agaagatgtg I860 
atggtgggcg gtgttgtcat agegacagtg 1920 
aaacagtaca catccattca tcatggtgtg 19S0 
gaccgccacc lulccaayai ycagcayaac 2040 
gaecagatge acaacaagaa gtag 2C94 
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-■213> lb 
*-211> 6 9 7 

<;U3> Hcir.o sapiens 
<4:o> It 

Met Leu Pre Gly Lou Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 

1 5 10 IS 

Ala Leu G_u Va_ Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala C-lu Pro 

2 U 2 5 3 0 

Gin Lis Ala Met Phe Cys Gly Arg Leu Asn KgL His Met Asn Val Gin 
jb 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Sor Gly Tin Lyt Tin cys lie Asp 

50 55 6C 

Thr Lys Glu Gly He Leu Gin Tyr cys Gin Glu Val Tyr Pro Glu Leu 
6S 7C 75 80 

Gin lie Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr lie Gin Asn 
£5 90 S5 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His ?ro His Phe Val 
100 105 110 

:1q Pro Tyr Arg Cys Leu Va_ Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 
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Va.1 Pro A^t-- Lys 
130 

Glu Thr His Leu 
145 

Lys Ser Thr Asn 

Asp Ly.? Ph* Arrj 
180 

^er Asp Asn Val 
195 

Trp Trp Gly Gly 
21 0 

Val Val Glu Val 
225 

Glu Ala Asp Asp 

Glu Ala Glu Glu 

260 

Ala Thr Thr Thr 
275 

Val Pro Thr Thr 



Cys Lys Phe Leu 
135 

His Trp His Thr 

150 

Leu Mis Asp Tyr 
165 

Sly Val Glu ?ho 

Asp Ser Ala Asp 

200 

Ala Asp Thr Asp 
215 

Ala Glu Glu Glu 
230 

Asp Glu Asp Asp 
245 

Pro Tyi Glu Glu 

Thr Thr Thr Thr 
280 

Ala Ala Ser Thr 



His Gin Glu Aiq 
140 

Val Ala Lys Glu 
155 

Gly Met Leu L^u 
170 

Val cys cys Pro 
185 

Ala Glu Glu Asp 

Tyr Ala Asp Gly 

220 

Glu Val Ala Glu 
235 

Glu Asp Gly Asp 

2 50 

Ala Thr Glu Arg 

265 

Glu Ser Val Glu 
Pro Asp Ala Val 



Met A;>p Veil Cy* 

Thr Cys Ser Glu 
160 

Pro Cys Gly He 
175 

Leu Ala Glu Glu 
190 

Asp Ser Asp Val 

2 05 

Ser Glu Asp Lys 

Val Glu Glu Glu 
240 

Glu Val Glu Glu 
255 

Thr Thr Ser lie 

270 

Giu Val Val Arg 
285 

Asp Lys Tyr Leu 
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290 



300 



Glu Thr Pre Gly Asp Glu As:i Glu His Ala His Phe Gin Lys Ala Lys 

305 31C 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Ket 3er Glr. Val Ket Arg 

32S 33 0 33 5 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 34b 350 

Lys Lys Ala Val He Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 

170 375 330 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr He Thr Ala Leu Gin Ala Val Pro Pre Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pre Lys Lys Ala 
435 440 445 



Ala Gin He Arg Ser Glr. Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 
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Arq Met Asn Gin 
465 

Glu Glu TIP Gin 

Tyr Ser Asp Asp 

SCO 

Tyr Gly Ar.n Asp 
515 

Val Glu Leu Leu 

Pro Trp His Set 
54 5 

Glu Val Glu Pro 

Thr Arcr Pro Gly 
580 

Glu Val Lys Met 
595 

His His Gin Lys 
61C 



Ser Leu Sar Leu 
47 0 

Asp Glu Val Asp 
485 

Val Leu Ala Asn 

Ala Leu Met Pro 
520 

Pro Val Asn Gly 

Phe Gly Aid. Asp 
550 

val Asp Ala Arg 
565 

Ser Gly Leu Thr 

Asp Ala Glu Phe 
600 

Lsu Val Pho PHq 
615 



Leu Tyr Asn Val 
475 

Glu Leu Leu Gin 
490 

Met lie Ser Glu 

5 05 

Ser Leu Thr Glu 

Glu Phe 3er Leu 
540 

Sex Val Pru Ala 
555 

Pre Ala Ala Asp 
57 0 

As:: 11g Lys Thr 
= 85 

Arg His Asp Ser 

Ala Glu Asp Val 
62C 



Pro Ala Val Ala 
480 

Lys Glu Gin Asn 
495 

Pre Arg lie Sor 

5 1 0 

Thr Lys Thr Thr 

525 

Asp Asp Leu Gin 

Asn Thr Glu Kan 

560 

Arg Gly Leu Thr 
575 

Glu Glu He Ser 
E90 

Gly Tyr Glu Val 
505 

Gly Ser Asn Lys 
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Gly Ala llu lie Gly Lea Met Val Sly Gly Veil Val He Ala Thr Val 
h2^j 630 63b 64 1: 

He Vai Tie Thr Leu Val Met Leu Lyy Lys Lys Gin Tyr Thr Ser Ha 
645 6 5 0 fiSE 

His His Gly Val Val G_u Val Asp Ala A^a Vai Tnr Pro Glu Glu Arg 
660 665 670 

His Lpu .^r Lys Mef Gin Gin Asr. Gly Tyr Glu Asn Pro Thr Tyr Lys 
67E 68C 6£5 

?Kg Phe Glu. Gin Met: Gin Asn Lys Lys 
6 9 0 6 9 5 

*210> 17 

«./ll> 2 094 

<212> DNA 

«:213> Ho-t.o sapisr.s 



v4C0> 17 



atgctgcccg 


gtttggcac t 


gctcctgotg 


gccgcctgga 


cggctcgggc 


gctggaggta 


60 


rccactgatg 


gtaangctgg 


cctg-tggct 


gaaccccaga 


ttgccatgi: t 


ccgrggcaga 


120 


ctgaacatgc 


acatgaatqt 


ccagaatggg 


aagtgggatt 


cagatccatc 


agygaccaaa 


180 


acctgcat -g 


^ta^caagga 


aggcat cctg 


cagtat tgee 


aagaagtc ta 


ccctgaactg 


243 


cacat caeca 


atgtggtaga 


agccaac-:aa 


ccagtgacca 


tccagaactg 


gtgeaagegg 


300 


ggccgcaagc 


agtjcaagac 


ccatccc :ac 


tttgtgattc 


cctaccgctg 


ctcagttggt 


36C 


gagtttgtaa 


gtgatgeect 


tctcgtccct 


gaeaagtgea 


aattcttaca 


ccaggagagg 


420 


atggatgttt 


gegaaactea 


tcttcactgg 


cacarcgtcg 


ccaaagagac 


atgcagtgag 


4SC 


aagagtacca 


ac ttgcatga 


etaeggcatg 


t tgcfcgccct 


geggaattea 


caagttccga 


540 
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ggcgtagagt ttgtgtgttg cccactggct 
4-cyaggagg atgactcgga tgtctggtgg 
agtgaagaca aagtagtaga agtagcagag 
get&yjcya^y aLyacyayya uyatyayyeit 
r rctacgaay aayccacaga cagaa^cacc 
gagtctgtgg aagaggtggt tcgagttccfc 
qacaagtatc tcgagacacc tggggatgag 
yagaggc tLg aygccaayca ccyagagaya 
g^agaacgcc aagcaaagaa cttgcc taaa 
caggagaaag tggaatcttt ggaacaggaa 
acacacalgg ccagagtgga agccatgctc 
•.acatcac-g ctcrgcaggr, tg/ttcctcct 
aagtatgtcc gccragaaca gaaggacaga 
cgcatggtgg atcccaagaa acccgctcag 
gtgatttatg agcycatgaa tcagtctctc 
yaggagattc aggatgaagt tgatgagctg 
jtcttyyccjd acalyaLlay Lyaaccaayg 
-ctttgaccg aaacgaaaac caccgtggag 
gacgatctcc agccytgyca ttcttttggg 
gaagttgagc ctgttgatgc cccccctgct 
rctgggttga caaatatcaa gacggaggag 
cgacatgact cagcatatga agttcatcat 
ggttcaaaca aaggtgcaat: cattggactc 
a:cgtcatca cctcggtgai gctgaagaag 
g^ggaggttg acgccqctgt: caccccagag 
.;gctacgaa-d atccaaccta caagttxt t 

<210> lb 

*.211> 6 97 

<212> PRT 

<213> Homo sapiens 



gaagaaagtg acaatgtgga ttctgctgat 600 
ggcggaycay acacagacta tgcagatggg 66 0 
gaggaagaag tggctgaggt ggaagaagaa 72 0 
gytyatgayg tayaygaaya ygctgayyaa 760 
agcartgcca ccaccaccac naocancana 84 0 
acaacagcag ccagtacccc tgatgccgtt 91) 0 
aatgaacatg cccatttcca qaaagccaaa 9 60 
atgteecagg tcatgagaga atgggaagag 1020 
gctgataaga aggeagttat crag-atttc 1080 
gcagccaacg agagacagca g~tg-jtggag 11.40 
aatgaccgcc gccgcctgcc cctggagaac 1200 
cggczzzgtc acgtgttcaa tatgc taaag 126 0 
cagcacaccc taaagcattt cgagcatgtg 132 C 
atccggtccc aggtnatgac acacctccgt 1380 
tccctgctct acaacgtgcc tgcagtggcc 1440 
cttcagaaag agcaaaacta ttcagatcac 1500 
ateayL^acy yaaacyatgc tctcatycca 15 6 0 
ctccttcccg tgaatggaga gttcagcctg 1620 
cctgactctg tgccagccaa cacagaaaac 163 0 
gccgaccgag gacrgaccac tcgaccaggc 1740 
atctctgaag tgaatctgga tgcagaattc 1800 
caaaaattgg tgttctttgc agaagatgtg 1360 
atggtgggcg gtgttgtcat agcgacagtg 1920 
aaacagtaca catccattca tcatggtgtg 1980 
gagcgccacc tgtccaagaL gcagcagaac 204 0 
gagcagatgc agaacaagaa gtag 20 94 



44 



WO 00/17369 



PCT/US99/2088I 



<400> 18 

Met Lou Pro Gly lou Ala Lou Leu Lou Leu Ala Ala Trp Thr Ale Arg 
15 10 IE 

Ala Leu Glu Val Pre Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 2b 30 

nin lie Ala Met ?he Cys Gly Arg Leu Asm Met His Met. Asn Val Gin 
1<3 40 45 

Asn Gly Lys Trp Asp Ser Asp Pre ser Gly Thr Lys Thr Cys lie Asp 
50 55 60 

Thr Lys Glu Gly Ilo Lgu GLn Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
6 5 7 0 75 SO 

Gin lie Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pre His ?h« Val 
100 105 110 

Il« Pro Tyr Arg Cys Leu Val Gly Glu Phc Val S^r Asp Ala Leu Lgu 
115 12C 125 

Val Pro Asp Lys Cys Lys Phs Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr cys ^er Glu 
145 " ISO 155 160 
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Lys Ser Thi Asn 

nsp Lys Pho Aru 
150 

Ser Asp Asn Val 

?rp Trp Gly Gly 
210 

Val Val Glu Va 1 
225 

Glu Ala Asp Asp 

Glu Ala Glu Glu 

260 

Ala Thr Thr Thr 
275 

Val Pro Thr Thr 
290 

Giu Thr Pro Gly 
30S 



Leu His Asp Tyr 
165 

Gly Val Glu Phe 

Asp Ser Ala Asp 

200 

Ala Asp r lhr Asp 
215 

Ala Glu Glu Glu 

230 

Asp Glu Asp Asp 
245 

Pro Tyr Glu Glu 

Thr Thr Thr Thr 

2 30 

Ala Ala Ser Thr 

295 

Asp Glu Asn Glu 
310 



Gly Mat heu Leu 
170 

Val cys cys Pro 
185 

Ala Glu Glu Asp 

Tyr Ala Asp Gly 
22C 

Glu Val Ala Glu 
235 

Glu Asp Gly Asp 
250 

Ala Thr Glu Arg 

265 

Glu Ser Val Glu 

Pre Asp Ala Val 

300 

His Ala His Phe 

315 



Pro Cys Gly He 
175 

Leu Ala Glu Glu 
190 

Asp Ser Asp Val 
2C5 

Ser Glu Asp Lys 

Val Glu Glu Glu 
240 

Glu Val Glu Glu 
255 

Thr Thr Ser lie 

27 0 

Glu Val Vdl Arg 
235 

Asp Lys Tyr Leu 

Gin Lys Ala Lys 

320 
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Glu Ai lj Luu. Glu 

GlU Trp GlU GiU 
34C 

Lys Lys Ala Val 

3 55 

Glr. Glu Ala Ala 
370 

Arc Val Glu Ala 

Tyr lie Thr Ala 

Asn Met Leu Lys 
420 

Thr Leu Lys His 
435 

Ala Glr. lie Arg 
450 

Arg Met Asn Glr 
465 

:Uu Glu lie Gin 



Ala Lys His Aig 
32b 

Ala Glu Arg Gin 

He Gin His Phe 

3 6C 

Asn Glu Arg Gin 
375 

Met Leu Asn Asp 
39C 

Leu Cln Ala Val 
405 

Lys Tyr Val Arg 

Phe Glu His Val 
440 

Ser Gin Val Met 
455 

Ser Leu Ser Leu 
470 

Asp Glu Val Asp 



Glu Arg Net Sex 
330 

Ala Lys Asn Leu 
345 

Gin Glu Lys Val 

Gin Lev; Val Glu 
380 

Arg Arg Arg Leu 
395 

Pro Pro Arg Pro 
410 

Ala Glu Gin Lys 
425 

Arg Met Val Asp 

Thr His Leu Arg 
460 

Leu Tyr Asn Val 
475 

Glu Leu Leu Gin 



Gin Val Met Arq 

335 

Pre Lys Ala Asp 

3 5 3 

Glu Ser Leu Glu 

j 65 

Thr His Met Ala 

Ala Leu Glu Asn 
400 

Arg Kis Val Phe 
415 

Asp Arg Gin His 
430 

Pro Lys Lys Ala 
445 

Val He Tyr Glu 

Pro Ala Val Ala 
480 

Lys Glu Gin Asn 
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435 



49C 



lyr Ser Asp Asp Val Leu Ala Asn Xc: Ila Ser Glu Pro Arg He Ser 
500 505 510 



lyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 

5L5 52C 525 

Val Glu Lieu Leu Pro Val Aeii Gly Glu Phe Ser Leu Asp Asp Leu Gin 

530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 

c ,4 r , 550 56 0 

^lu val Glu Pre Val Asp Ala Arg pre Ala Ala Asp Arg Gly Leu Thr 
565 570 575 



Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu lie Ser 

^80 585 59C 

Glu Val Asn L<?u A,<=p Ala Glu Php Arg His Asp Ser Gly Tyr Glu Val 

595 600 60S 

His His Gin Lys Leu Val Phe Phs Ala Glu Asp Val Gly Ser Asn Lys 

610 615 620 

7,ly Ala lie He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 

625 63C 635 640 

lie Vctl He Thr Leu Val Me:: Leu Lyy Lyt; Lys Glu Tyr Thr Ser He 
64 5 650 65 5 
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H:s His Gly Val Val Giu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 67 0 



His Leu Ser Lys Met Gin Gin Asm Gly Tyr Glu Asn Pro Thr Tyr Lys 
67 S 630 595 



rhc PhG Glu Gin Met Gin Asn Lys Lys 
690 695 



*:21U> iy 

<211> 2094 

<212> DNA 

<213> Horr.o sapiens 



<400> 19 

atgctgcccg gtttggcact gctcctgctg cccgcctgga cggctcgggc gctggaggta 60 
cccactgatg gtaargctgg cctgctggct gaac^ccaga ttgcsatgtt ctgtggcaga 120 
ctgaacatqc acatqaatg'; ccagaaLqgg aaqtgqgatt cagacccatc agggaccaaa 18 J 
acctgeattg ataccaagga aggcatcctg cagtattgee aagaagtcta ccctgaactg 24 0 
■zagatcacca atgtggtaga agccaaccaa ccagtgacca tccagaactg gtgeaagegg 300 
ggccgcaagc agtgcaagac ccatccccac tttgtgantc cctaccgctg cttagttggt 360 
gagtttgtaa gtgatgece^ tetegttset gaeaagtgea aattct:aca ccaggagagg 4;:0 
drggatgttt gegaaactea tcttcactgg cacaccgtcg ccaaagagac atgcagtgag 480 
aagagtacca acttgeatga etaeggcatg ttgctgccct gcggaa~tga caagttccga 540 
ggggtagagt ^tgtgtgttg cccactggct gaagaaagtg acaatgegga ttctgetgat 60C 
geggaggagg atgactcgga tgtctggtgg ggcggageag acacagacta tgcagatggg 6nC 
agtgaagaca aagtagtaga agtagcagag gaggaagaag tggctgaggt ggaagaagaa 720 
gaagccgatg atgacgagga cgatgaggat ggtgatgagg tagaqgaaga ggctgaggaa 780 
:cctacgaag aagecacaga gagaaccacc accattgcca ccaccaccac caccac:aca S40 
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gag^ctgcgg aagaggtggt tcgagttcct acaacagcag ccagtacccc tgatgccgtt DOC 

tra^aagtatr rrg.igar.arr f-gggga "gag aa r.gap. na" g rr:rartrrra ganngc~aaa 3 60 

gagaggctty eiggc^aagca ccgagagaga atgtcccagg tcatgagaga atgggaagag 1020 

ucagaacgtc aagcaaagaa ctcgcctaaa gccqaLaaga aggcagttac ccagcatttc IC80 

■rraggayaaag tgyaatcttt gyaacaygaa gcaqccaacy agagacag:a gctggtggag 114? 

T.cacacatgy ccagagtgga agccatgctc aatgaccgcc gccgcctggc cctgyayaac 120V, 

•.a:atcaccg r tctgcaggo tgttcctcct r-ggcctcgtr ar.gtgttsan r.at.gc t.aaag i:>h[; 

aagtatgtcc gcg-agaaca gaaygacaga cagcacaccc taaagcattt cgagcatg.g 132C 

cgcatggtgg atcccaagaa agccg^tcag at.ccggtccc aggttatgac acacctc-gt 1380 

jLgacttatg agcgcargaa tcagtctctc Lccctgctc: acaacgtgcc tgcagtggcc 1440 

^aggagat t_c agyatyaayt tgazgagctg cticagaaag agcaaaacta ttcagatgac 1500 

-jtcttggcca acatgattag tgaaccaagg atcagttacg gaaacgatgc rctcatgcca 1S60 

tctttyaccg aaacgaaaac caccgtggag ctccttcccg tgaatggaga gttcagccty 1620 

gacyaLctcc agccgtggca ttcttttggg g = Lgact.ctg tgccagccaa cacagaaaac 1680 

gaagttgagc ctgtLgalgc ccgccctgct gccgaccgag gactgaccac tcgaccaggt 1740 

Lctgggttga caaatatcaa gacggaggag atctictgaag tgaagatgga tgcagaattc IS 00 

■cgacatgacL caggazataa agtccatcat caaaaatryg tgttctutgc agaagatgty 166C 
rgttcaaaca aaggtgzaat cattggactc a-ggtgggcg gtgttgteat agcgacagcg IS2C 
c.tcctcatca ccttgy-gat gctgaagaag aaacagtaca catccattca ~catggtgtg 1S£0 

ctggaggttg acgccgctgt caccccagag gagcgccacc -gtccaagat gcagcagaac 2040 

cgcta:gaaa atccaaccu caagttcttt gagcagatgc agaacaagaa gtag 2034 

-?10> 20 
<.211> 697 
<?12> PRT 

<213> Hor.o sapiens 
<400> 20 

Met Leu Pre Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
IS 10 15 
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Ala Leu Glu Val Pre Thr Asp Gly Asn Ala Gly Leu Leu Aid Glu Pro 

20 25 3 0 

Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Ker. Asn Val Gin 
3S 4 0 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Tnr Lys Thr Cys He Asp 
50 55 60 

'iTir Lys Glu Gly Hp Leu Gin Tyr Cys Glr. Glu Val Tyr Pro G^u Leu 
65 70 75 £0 

Gl;: Hg Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr lie Gin Asn 
8 5 9 0 9 5 

'Trp Cys Lys Arg Gly Arg Lys Gin cys Lys Thr His Pro His Pho Val 
103 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Pho Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Pho Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Cly Met Leu Leu Pro Cys Gly He 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Lgu Ala Glu Glu 
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1B0 135 1^0 

Ker Asp A?n Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
125 2 CO 205 

Trp Trp Gly Gly Aia Asp Thr Asp Tyr Ala Asp GLy Ger Glu Asp Lys 
210 215 2 2 0 

Val Val Glu Val Ala Glu GLu Glu Glu Val Ala GLu Val Glu Glu Glu 
125 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu va i c;lu Glu 
245 25C 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
263 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Scr Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 3C0 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 

3 05 31C 315 32 0 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
32b 330 33d 

Glu Trp Glu Glu Aia Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 

340 345 350 
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Lys Lys Ala Val 

355 

Gin Glu Ala Ala 

3 7 0 

Arg Val Glu Ala 

3 85 

Tyr lie Thr Ala 

Asn Mst Leu Lys 
420 

Thr Lea Lyi His 

43 5 

Ala GJ n lie Arg 
450 

Arg Met Asn Gin 
465 

Glu Glu lie? Gin 

Tyr 3er Asp Asp 
500 



_le Gin His Phe 

: s 60 

Asn Glu Ary Gin 
37 5 

Met Lou Asn Asp 

39 0 

Leu Gin Ala Val 

4 C 5 

Lys Tyr Val Arc 

?he Glu His Val 
440 

Ser Gin Val Ket 
455 

Ser Leu Ser Leu 
470 

Asp Glu Val Asp 
485 

Val Leu Ala Asn 



Gin Glu Lys Val 

21 n Leu Val Glu 
380 

Arg Arg Arg Leu 
395 

PrD Tro Arg Pro 
410 

Ala Glu Gin Lys 

Ary Met Val Asp 

Thr His Leu Arg 
460 

Leu Tyr Asn Val 
475 

Glu Leu Leu Gin 
43C 

Kec lie Ser Glu 
505 



L-lu Ser Lea Glu 

3 65 

Thr His Met Ala 

Ala Leu Glu Asn 
400 

Arg His Val Phe 
415 

Asp Arg Gin His 
430 

Pic Lys Lys Ala 
445 

Val lie Tyr C-iu 

Pro Ala Val Ala 
480 

Lys Glu Gin Asn 
495 

Pro Arg I 1 e Ser 
510 
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•lyr Gly Asn Asp Ala Leu Met ?rc Ser Leu Thr Glu Thr Lys Thr Thr 
bib 520 525 

Val Glu Luu Luu Pr'j Veil As:. Gly Glu Phe S«r Leu Asp Asp Leu Gin 

^3 0 53 5 54 0 

Pre i'rp His Sor phe Gly Ala Asp Ssr val Pre Ala Asn Thr Glu Asn 
u 45 550 555 560 

Glu Val Glu Pre Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 576 

"hr Arg Pro Gly 3pr Gly L*u Thr Asn lie Lys Thr Glu Glu lie Ser 
53C 505 590 

Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ssr Gly Tyr Glu Val 
595 60 0 605 

His His Gin Lys Leu Val The Phe Ala Glu Asp Val Gly Ser Asn Lys 

610 615 62 0 

Gly Ala lie lie Gly Leu Met Val Gly Gly Val Val lie Ala Thr Val 
t25 630 635 640 

lie Phe lie Thr Leu Val Met: Leu Lys Lys Lys Gin Tyr Thr Ser lie 
64 5 650 655 

His His G^y Val Val Giu Val Asp Ala Ala Vai Thr Pro Giu Glu Arg 

660 665 67C 

Hip Leu 3er Lys Me!: Gin Gin Asn Gly Tyr Glu Asn ?rr> Thr Tyr Lys 
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G7^ 



Pha t'he Glu Gin Met Gin Asn Lys Lys 
690 695 



^210> 21 

<::i> 1341 
-::i2- dna 

* 213- Home sapiens 
<400 > 21 

.irgg^tagoa trgacrggfcgg aragcaaatrg 
;cgc:cctec gcagcggcct ggggggegee 
sacg.iagagc ccgaggagcc eggceggagg 
^g^caagt eggggcaggg etactaegtg 
-tca.icatcc tygtggatctc ayycagcagt 
-.tcc'.gcatc gctactacca gaggcagctg 
gtgt-atgtgc cctacaccca gggcaagtgg 
atcc-ccatg gccccaacgt cacrgtgcgt 
aagt-cttca tcaaeggctc caactgggaa 
gccaggcctg acgactccct ggagecttte 
■:ecaacctct tctccctgca cctttgtggt 
ctggcctctg teggagggag catgatcatt 
ay t^Lcicgt atacacccat ceggegggag 
gagatcaatg gacaggatct gaaaatggac 
gtggacagtg gcaccaccaa ecttegttty 
tccatcaagg cagcctcctc caeggagaag 
ctggtgtigct ggcaagcagg cacsacccct 
■jtaaLgggLy agyttaccaa ccaytcct-c 
cggccagtgg aagatgtggc cacgtcccaa 



ggrcgeggat - rr , acrcag*'a cggcatccgg 60 
cccctggggc tgcggctgcc ccgggagacc 120 
ggcagctttg tggagatggt ggacaacctg 180 
gagatgaccg tgggcagccc cccgcagacg 240 
aac:t:ycag igggtgctgc cccccacecc 3 00 
tccagcacat acegggac-t cccgaagggt 3 6C 
gaaggggagc tgggcaccga cctggtaagc 420 
gecaacattg ctgccatcac tgaatcagac 460 
ggcatcctgg ggctcgccta tgctgagatt 540 
tttgact etc tggtaaagca gacccacgt: 600 
gciggcttcc ccctcaacca gtctgaagtg 660 
ggaggtatcg accactcgct gtacacaggc 72 0 
tggtattatg aggtcatcat tgnccyggtg 78C 
tgcaaggagt acaactatga caagagcatt 84C 
cccaagaaay tgtttgaagc tgcagtcaaa 9 0 0 
ttccctgatg grttctggct aggagagcag 9 61 
tggaacattt t:ccagtcat ctcactctac I02C 
cgcaLcacca '.cc l Lccgca ycaatacclg 1080 
gacgactgtt acaagtttgc catctcacag 1140 
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t = at = cacgg gcactgtrat gggsgctg^t atcatggagg gcttxtacgt tgt^tttgar. 120C 
cjgaeceyaa aacgaaLLyg ct-tyccgtc agcgcttqcc atgtgcacga tgaattcagg 1260 
asggcagcgg tggasggccc ttttgtcasc rtggacatgg aagactgtgg ccacaacatt 132C 
cracagacag atgagLcaty a 1341 

-21()> 22 
<2i:> 446 
-212> PPT 

- 2 1 :i : ' K ■ mt.o sapiens 

<4 0 0> 22 

y.et Ala sor xct Thr G]y Gly Gin Gin Kc L Gly Arg Gly Ser Thr Gin 
- 5 m IS 

•lis lily lie Arg Leu Pro Leu Arg Ser Gly Leu Gly Gly Ala Pro Leu 
20 25 30 

Gly Leu Arg Leu Pro Arg Glu Thr Asp Glu Glu Pro Glu Glu Pre 3ly 
35 40 45 

Arg Arg Gly Ser ?he Val Glu Met Val ksv Asn Leu Arg Gly Lys Ser 
5 0 5b 6 0 

Gly Sin Gly Tyr Tyr Val Glu Met Thr Val Gly Ser Pro Pro Gin Thr 

65 70 75 80 

Leu Asn lie Leu Val Asp Thr Gly Se: Ser Asn Phe Ala Val Gly Ala 

8 1 y ^ S b 

Aid Pro His Pro Phe Leu His Ary Tyr Tyr Gin Arg Gin Lsu Ser Ser 

100 105 110 
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Thr Tyr. Ar;; Asp Leu Arg Lys Gly Val Tyr Va 1 Pro Tyr Thr Gin Gly 
115 120 125 

Lys Trp Glu Gly Glu Leu Gly Thr Asp Leu Val Ser I In Pre His Gly 
130 125 140 

7*r d Asn Val Thr Val Arg Ala Asn lie Ala Ala lie Thr Glu Ser Asp 
145 150 155 160 

i ys ?he Phe lie Asn Gly Ser Asn Trp Glu Gly He Leu Gly Leu Ala 
165 17U 175 

Tyr Ala Glu He Aid Aiy Pro Asp Asp Ser Leu Glu Pre Phe The Asp 

:8n 135 iyo 

Ser Leu Val Lys Gin Thr His Val Pro Asr. Leu Phe Ser Leu His Leu 
195 200 205 

Cys Gly Ala Gly Phe Pro Leu Asn Gin Ser Glu Val Leu Ala Ser Val 

210 215 220 

Gly Gly Ser Met: T;g Tie Gly Gly He Asp His Ser Leu Tyr Thr Gly 
.25 230 235 240 

Scr Leu Trp Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr Glu Val He 
245 25C 255 

He Val Arc Val Glu He Asn Gly Gin Asp Leu Lys Met Asp Cys Lys 

26U 265 270 
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Clu 'IV r A- r - Ty r 
275 

Arg Leu Pro Lys 

2 'J C 

Aid Sue Ser Thr 

:'05 

Leu val Cys Trp 

: le fieir Leu Tyr 
340 

Thr Tie r.ou Frn 
355 

Gin Asp Asp 
37 0 

Thr Val Met Gly 
385 

Arg Ala Arg Lys 

Asp Glu Phe Arg 
42C 

Met Glu Asp Cys 



Anp Lyn Sor 11= 
28 C 

Lys Val Phe Glu 
29b 

Glu Lys Phe Pro 

31C 

Gin Ala Gly 'i'hr 
Leu Met Gly Glu 
O In Glr, Tyr 

360 

Zys Tyr Lys Phe 

37 5 

Ala Val He Xet 

3 9 0 

Arg He Gly Phe 
405 

Thr Ala Ala Val 
Gly Tyr Asn lie 



Val Asp Sor Gly 

Ala Ala Val Lys 

330 

Asp Gly Phe Trp 

31S 

Thr . J ro Trp Asn 
33 0 

Val Thr Asn 3 In 
345 

Arg Pre Val Ghi 

Ala lie Ser Gin 

3 80 

Glu Gly Phe Tyr 

3 95 

Ala Val Ser Ala 
41 J 

Glu Gly Pre Phs 
425 

Pro Gin Thr Asp 



Thr Thr Asn Lou 
285 

Ser 11 g Lys Ala 

Lgu Gly Glu Glr. 

3 20 

1 le Phe Pro Val 
335 

Ser Phe Ary lie 
350 

Asp Vni Ala Thr 
365 

Ser Ser Thr Gly 

Val Val Phe Asp 
4C0 

Cys His Val His 
41b 

Val Thr Leu Asp 

43 0 

Glu Ser 
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435 440 445 

<210> 22 
<212 • DNA 

<213> Homo sapiens 



<400 - 22 

atggrtagca tgactggtgg acagcaaatg 
jjyc ^tgaac aggacggatc cacccagcac 
gyyggcgccc ccctggggct gcggctgccc 
ggccggnggg g=agctr.tgt ggagatggtg 
tactacgtgg agatgaccgt gggcagcccc 
ggcagcagta act^tgcagt gggtgctgcc 
agg.agctgt ccagcacata ccgggacctc 
gjcaagtggg aagyggagct gggcaccgac 
actg^gcgtg ccaacattgc tgccatcact 
aacngggaag gcarrcctcgg gcigqccz&z 
gaycctLLct ttgac Iclct. ggiaaaycay 
c:tttgr.ggtg ctgcjr.tt.ccc. ncr.caaccag 
atgatcattg gaggtiatcga ccactcgctg 
eggcgggagt ggtatlatqa ggtcatcatt 
aaaatggact gcaaggacta caactatigac 
cttcgtttgc ccaagaaagt gtttgaagct 
acggagaagt tccctgatgg tttctggcta 
accacccctt ggaacatttt cccagtcatc 
caqtcct tec gcatcaccat cct~ccgcaq 
acgtcccaag acgactgtta caagtttgcc 
ggagctytta ^catyyayyu ettctacgtt 
tttgctgtca gcgct-gcca tgtgcacgat 



ggtcgcggat cgatgactat. ctctgacrct fcU 
ggcatcc^gc Lgcccctgcg cagcygccty 12C 
cyyyayaccy acyaayaycc cyayyagccc 18 C 
gacaacc^ga ggggcaagtc ggggcagggc 240 
ccgcugacgc tcaacat^ct ggtggataca 300 
ccccacccct tcctqcatcg ctactaccag 360 
cggaagggtg tgtatgtgrc ctacacceag 420 
ctggraagca tcccccatgg ccccaacg:c 480 
gaatcagaca agt tcttcar caacggctcc 540 
gctgagattg ccaggcctga cgactccctg 600 
acccacyttc ccaacctctt ctccctgcac 660 
tctgaagtgc tggcctctgt cygagggagc 72C 
tacacaggca gtctctggta tacacccatc 780 
gcgcgggtgg agatcaatgg acaggatctg 840 
aagagcatitg tgqacagtgg caccaccaac 900 
gcagtcaaat ccatcaaggc agcctcctcc 960 
ggagagcagc tggtgtgctg g=aagcaggc 102 0 
tcactctacc taatgggtga ggttaccaac 10£0 
caatacctgc qqccactqqa ayatgtqgcc 1140 
atctcacagt catccacggg cactgttatg 1200 
gtctttgatc gggcccgaaa acgaattggc 12 6 0 
gagttcagga cggcagcggt ggaaggccct 1320 
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tttgtcacct tggacatgga agactgtggc ta-aacattc cacagscaga tgagtcatga 12 SC 

< ::10> 2 4 
<:il> 453 

«. :a2> prt 

«..!13> Homo sapiens 
.-.:cc> 24 

v^t Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly Ser Met: Thr 
: 5 10 15 

He Ser Asp ser Pro Arg Glu Gin Asp Gly Ser Thr Gin His Gly lie 

2 0 2-. ™ 

Arg Leu Pre Leu Arg Ser Gly Leu Gly Gly A. a Pre Leu Gly Leu Arg 
35 40 45 

Leu Pro Arg Glu Thr Asp Glu Glu Pre Glu Glu Pro Gly Arg Arg Gly 

SO 55 6C 

.q Pr p ne Val Glu Met Val Asp Asn Leu Arg Gly Lys Ser Giy Gin Gly 
65 70 75 80 

Tyr Tyr Val Glu Mat Thr Val Gly Ser Pro Pro Gin Thr Leu Asn lie 
85 90 95 

Lea Val Asp Thr Gly Ser Ser Asn Phe Ala Va^ Gly Ala Ala Pro His 
100 105 110 

Pio Phe Le_: His Arg Tyr Tyr Gin Aiy Gin Leu Ser Ser Thr Tyr Ary 
115 120 12 5 
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Asp Leu Arg Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu 
13 J 13 5 140 

Gly Glu Leu Gly Thr Asp L^u Val Ser H* Pro His Gly Pro Asr. Val 
145 150 155 100 

Thr Val Arg Ala Asn He Ala Ala lie Thr Glu Ser Asp Lys Phe Phe 
165 17 0 17 5 

:le Asn Gly Ser Asn Trp Glu Gly He Leu Gly Leu Ala Tyr Ala Glu 
180 18b 190 

lie Ala Kic Pro Asp Asp Ser Leu Glu Pre Phe Phe Asp Ser Leu Val 

; 9 5 2 0 0 2 0 E 

Lys Gin Thr His Val Pre A=;n Leu Phe Ser Leu Kis Leu Cys Gly Ala 
210 215 223 

■Jly Phe Pro Leu Asn Glr. Ser Glu val Leu Ala Ser Val Gly Gly Ser 
;25 23C 235 240 

Met He lie Gly Gly lie Asp His Ser Leu Tyr Thr Gly Ser Lou Trp 
245 250 255 

Tyr Thr Pro He Arg Arc Glu Trp Tyr Tyr Glu Val Ho He Val Arg 
260 265 270 

Val Glu He Acn Gly Gin Asp Lou Lys Mot Asp Cys Lys Glu Tyr Asn 
275 280 285 
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Tyr Asp Lys Ser 
290 

L/x Lys Val Phe 

Thr Glu Lys Phe 

Trp Gin Ala Gly 
34C 

Tyr Lpu Met Gly 
355 

Pre Gin Gin Tyr 
370 

Asp Cys Tyr Ly:s 
185 

Gly Ala Val lie 

Lys Arg lie Gly 
42 0 

Arg Thr Ala Ala 
435 



He Val Asp Ser 
295 

Glu Aid Aid Vdl 
310 

Pro Asp Gly Phe 
325 

Thr Thr Pro Trp 

Glu Val Thr Asn 

3G0 

Leu Arg Pro Val 
375 

Phe Al a He Ser 
390 

Met Glu Gly Phe 
405 

Phe Ala Val Ser 

Val Glu Gly Pro 
440 



Gly Thr Thr Asn 

300 

Lys Sei He Lys 
31S 

Trp leu Gly Glu 

33C 

Asp. He Phe Pro 
345 

Gin Ser ?p.p Arg 

Glu Asp Val Ala 

380 

Gin Ser Ser Thr 
395 

Tyr Val Val Phe 
410 

Ala Cys His Val 
425 

Phe Val Thr Leu 



Leu Arg Leu Pro 

Ala Ala Ser Ser 

Gin Leu Val Cys 
3 3 5 

Val He Ser Leu 
350 

Tie Thr Tie Leu 
365 

Thr Ser Gin Asp 

Gly Thr Val Met 
4C0 

Asp Arg Ala Arg 
415 

His Asp Glu Phe 
430 

Asp Met Glu Asp 
445 



Cys Gly Tyr Asn He Pro Gin Thr Asp Glu Ser 
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450 455 

• 2 1 [) > 2 b 

« 211> 13 02 

* 2 1 > Ho^.o s ap i er* s 



< 1 0 C > 2 5 

nt-gactcagc ,-r.ggt" a r rrg r.r.r grcartg 
■rLgcgtctgc cccggguguc cgacyaagag 
g-ggagatgg :ggacaacct gaggggcaag 
o'-gggcagc= ccccgcagac gctcaacatc 
gtgggtgctg cccccc^ccc cttcctgcat 
Uccgggacc tccggaaggg tgLgt_atg*g 
ctcggcaccg acctggtaag catcccccat 
octgccatca ctgaatcaga caagttcttc 
gjgctggcct atgctgagat tgccaggcct 
ctggtaaagc acacccacgt tcccnacctc 
cccotxaacc actccqaaqt cetggcctct 
gaccactcgc tgtacacagg cagtctctgg 
4aggtcatca ttgtgcgggt cgagatcaat 
tacaactatg acaagagcat tgtggacagt 
gtgtttgaag ctgcagtcaa atccatcaag 
3gtK. tczggc tacgagagca gctggtgcgc 
ttc;cag:ca tctcactcta cctaatgggt 
atc-ttccgc agcaatacct gcggccagtg 
tacaagtttg ccatctcaca gtcatccacg 
ggcttctacg ttgtctttga tcgggcccga 
:atgtgcacg atgagttcag gacggcagcg 
gaagactgtg gctacaacat tccacagaca 



rgtagrggtr rgggf.ggrgc t.^rarfgggr. fiO 
cc-gaggacc ccggccggag gggcagcttt 12 0 
-cggggcagg gctactacgt ggagatgacc lrO 
ctgg-ggata caggcaccag taactttgca 24 0 
cgctactacc agaggcagct gtccaccaca 300 
ccctacaccc agggcaagtg ggaaggggag 3 L 0 
ggccccaacg r.cactgtgcg tgccaacatt 4 2 0 
aLcaacggcL ccaactggga aggcatcctg 480 
gacgactccc tggagccttt ctntgactct 540 
ttctccctgc acctttgtgg tgctggcCLc 600 
gtcggaggga gcatgatcat tggaggtatc 5 60 
tatacaccca tccggcggga gtggtattat: 7.10 
ggacaggatc tgaaaatgga ctgcaaggag 780 
ggcascacca accticgttt gcccaagaaa 340 
qcagcctcct ccacggagaa gttccccgat 90C 
tggcaag:ag gca:cacccc ttggaacatt 9*50 
gaggtta^ca accagtcctt ccgcaicacc 1020 
gaagatgrgg ccacgtccca agacgactgt 1380 
ggcactgcta tgggagctgt tatcatggag 1140 
aaacgaattg gctthgctgt cagcgcttgc 1200 
gtggaaggcc cttttgtcac cttggacatg 126;] 
gatgagtcat ga 1302 
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<a0> 26 

* :n> 133 
-::2> ?KT 
*I-13> Homo sapiens 

*4GG> 2 6 

Met Thr Glr. His Gly lit Arg Leu Pro Leu Arg Ser Gly Leu Gly Gly 
1 S 10 15 

Aln Pro Leu Giy Lou Arg Leu Pro Arg Glu Thr Asp Giu Glu Pro Glu 
2 0 25 3 0 

L ; lu Pro Gly Arg Arg Giy Ser ?he Val Glu Met Veil Asp Asn Leu Arg 
3b 4C 45 

Gly Lys Ser Gly Gl;i Gly Tyr Tyi Veal Glu Met Thr Val Gly Ser Pre 

50 55 60 

rrc Gin Thr Leu Asn lie Leu Val Asp Thr Giy Ser ser Asn Phe Ala 
65 70 75 8C 

Val Gly Aia Ala Pro His Pro phe Leu His Arg Tyr Tyr Gin Arg Gin 
85 90 95 

Leu Ser Ser Thr Tyr Arg Asp Lsu Arg Lys Gly Val Tyr Val Pre Tyr 
10C 10E 11C 

Thr Gin GJy Lys Trp Glu G^y Glu Leu Gly Thr Asp Leu Val Ser I^e 
115 120 125 
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Pi (j His Gly Pro 
130 

Glu ser Asp Lys 

1 45 

Gly Leu Ala Tyr 

?he Phe Asp Ser 

180 

Leu His Levi Cys 
195 

Ala Ser Val Cly 
210 

Tyr Thr Gly Ser 
225 

Glu Val lie lie 

Asp cys Lys Glu 
260 

Thr Asn Leu Arg 
275 

He Lys Ala Ala 



Asn Val Thr V<-1 
135 

Phe Phe lie Asn 
150 

Ala Glu Ilo Ala 
165 

Leu Val Lys Gin 

Gly Ala Gly Phe 

2 00 

Gly Ser Met lie 
215 

Leu Trp Tyr Thr 

2 J C 

Val Arg Val Glu 
245 

Tyr Asn Tyr Asp 

Tjp-; Pro Tys Lys 
280 

Ser Ser Thr Glu 



Aiy Ala. As ii lit; 

140 

Gly ser Asn Trp 
155 

Arg Pro Asp Asp 
17C 

Thr His Val Pro 

135 

Pro Leu Asn Gin 

Ilo Gly cly He 
220 

Pro lie Arg Arg 

2 3 5 

He Asn Gly Gin 
250 

Lyr. Ses He Val 
265 

Val Phe Glu Ala 
Lys Phe Pro Asp 



Ala Ala lit Thr 

Glu Gly lie Leu 
160 

Ser Leu Glu Pro 
175 

Asn Leu Phe Ser 
190 

Ser Glu Val Leu 
205 

Asp Hie Ser Leu 

Glu Trp Tyr Tyr 

Asp Leu Lys Mot 
255 

Asp Sor Gly Thr 
27C 

Ala val Lys Ser 
285 

Gly Phe Trp Leu 
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290 

Gly Glu Gin Leu 

305 

Ph« Fro Va ^ lie 

Phe Arg lie Thr 
340 

Val Ala Thr Ser 
355 

."er Thr Gly Thr 
370 

7a 1 Phe Asp Arc 

365 

Val His Asp 

Thr Leu Asp Met 
420 



295 

Val Cys Trp GJn 
31C 

Ser Leu 'Pyr ^eu 
325 

I In Leu Pro Gin 

Gin Asp Asp Cys 
3 SO 

Val Met- Gly Ala 
375 

Ala Arg Lys Arg 
3 90 

Glu Phe Arg Thr 
405 

Glu Asp Cys Gly 



300 

Ala Gly Thr Thr 
315 

Met Gly Glu Val 
330 

Gin Tyr Lei: Arg 
345 

Tyr Lys Phfc A. a 

Val Tip Met Glu 
380 

lie Gly Phe Ala 

395 

Ala Ala Val Glu 
410 

Tyr Asn lie Pro 
42 5 



Pro Trp Ann lie 
320 

Thr Asn Gin Ser 
335 

Pre Val Glu Asp 
3 50 

11 g Ser Gin Ser 
365 

Gly Phe Tyr Val 

Val Ser Ala Cys 
400 

Gly Pro Phe Val 
415 

Gin Thr A^p Glu 

43 0 



?10> 27 
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-■211> 127S 

oi2> d:ia 

-213> Homo sapiens 
<4C0> 27 

atggctagca tgactggtgg acay~aaatg 
C':cctggact ctggtatcga aaccgacgga 
ggcaagtcgg ggcagggcta ctacg-ggag 
acLca'.cctgy igyaLa^ayy caycaytaac 
rtgcatcgct actaceagag gcagctgtcc 
:atgLgc::t acacccaggy caagtggyaa 
•rcccatygcc ccaacgtcac tgtgcg.gcc 
'.tcttcatca acggct~caa ctgggaaggc 
aggcctgacg actccctgga gcctttcttt 
aacctcrt-^ ccctgcacct ttgtggtgct 
gcctctgtcg gayggaycat gatcactgga 
-_-tctggtata cacccatccg gcgggagtgg 
atcaatgyac agcatctgaa aatggaczgc 
gacagtggca ccacr.aacct tcgtttgccc 
atcaagqcag cctcctccac gcagaagccc 
gtgtgctggc aagcaggcac caccccttgg 
atgggtgagg ttaccaacca gtccttccgc 
-cagtggaag atgtggccac gtcccaagac 
-ccacgggca ctgttatggg agctgttiatc 
ycccgaaaac yaatcggcct tgcigtcagc 
jcagcggtgg aagycccttt tgtcacctlg 
•:agacagatg agtcatya 

<210> 28 
<211> 425 
<212> PRT 



ggtcgcggaL cgatgactat ctctqactct 60 
tcctttgtgg aga^ggtgga caacctgagg 120 
atgaccgtgg gcaqcccccc gcagacgctc ISO 
tttycayLyg yiyctgcccc ccacccctic 240 
agcacat.acr gggacctrrg gaagggtgtg 3f0 
ggggayc^yy qcaccgacct gg h „aagcatc 3C0 
aacatrgctq ccatcacrga atcagacaag 
at^ctggggc tiggcctaLgc t:gagattgcc 4KC 
gactctctgg taaigcagac ccacgttccc 54C 
ggcttccccc ncaaccagtc rgaagtcctg 6UC 
ggtatcgacc actcgc tgta cacaggcagt 660 
tattatgagg tcatcattgt gcgggtggag 720 
aaggagtaca actatrgacaa gagcattgtg 7R0 
aagaaagtgt ttgaagctgc agtcaaatcc 940 
cctgatggtt tctgqctaqq agaqcagctq 900 
aacattrtcc cagtcatctc actctaccta 9-i0 
atcaccatcc ttccgcagca atacctgcgg 102 0 
cactgttaca agtttgccat ctcacagtca 1)80 
atggagggct tctacgttgt ctrtgatcgg 1140 
gcttgccaty tgcacgatga gttcaygacg 1200 
gacatggaag actgtggcta caacattcca 126 0 

1273 
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<213> Homo sapiens 
-400> 23 

Met Ala Set Met Thi Gly Gly Gin Glu Met Gly Ary Gly Sui Met Hir 

1 5 1C IB 

lis Sar Asp Ser I'ro ~.eu Asp Ser Gly lie Glu Thr Asp Gly Ser rhc 
20 25 30 

Val CI a Met Val Asp Asr, Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr 
3S 40 45 

Val Glu Met Thr Va. Gly Se»r Pro Pre Gin Thr Leu Asr. lie Leu Val 
50 55 60 

Asp Thr Gly Ser Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe 
65 70 75 £0 

Leu His Arg Tyr Tyr Glr. Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu 

85 ?0 95 

Arg Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu 

100 105 110 

Leu Gly Thr Asp Leu Val Ser lie Pre His Gly Pro Asn Val Thr Val 

IIS 120 125 

Arg Ala Asn He Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn 

130 135 140 

Giy Ser Asn Trp Glu Gly Tie Leu Gly L^.: Ala Tyr Ala Glu He Ala 
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1GC 



Arg Pro A^p Asp scr Leu Glu Pro Phe Pho Asp Ser Leu Val Lys Gin 
I6 r : 170 175 

Thr His Val Pro Asn Leu Phe Ser Leu His Leu Cys Gly Ala Gly Phe 
180 18= 19U 

Pro Leu Asn Gin Ser Glu Vctl Leu Ala Ser Veil Gly Gly Ser Met lie 
19=. ?.0fj 2DS 

J le Gly Gly lie Asp His ser Leu lyr Thr Gly ser Leu Trp Tyr Thr 
210 215 22C 

Pro lie Arg Arg Glu Trp Tyr Tyr Glu Val llG Va 1 Ar 9 Val Giu 

225 230 235 240 

Tie Asr, Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp 
245 25C 255 

Lys Ser lis Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys 
260 265 270 

Val Pho Glu Ala Ala Val Lys Ser 11g Lys Ala Ala Sor Ser Thr Glu 
275 280 285 

Lys Phe Pre Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin 
290 295 3C0 

Ala Gly Thr Thr Pro Trp Asn lie Phe Pre val lie Ser Leu Tyr Leu 
•05 310 315 320 
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Mer Gly Glu Val 

Gin Tyr Leu Arg 
340 

Tyr Lys Pho Ala 
355 

Val IIg Ktst: Glu 
3 7 C 

Ilo Gly Phe Ala 
385 

Ala Ala Val Glu 

Tyr Asu I1<j Pro 
4 2Pj 



Thr Asn Gin 5er 
325 

Pre Val Glu Asp 

He Ser GLn £er 

360 

Gly Pho r :Vr Val 
375 

Val Ser Ala Cys 

3S0 

Gly Fro Phe Val 
405 

Gin Thr Asp Glu 



Pne Arg He Thr 
330 

Val Ala Thr Ser 
345 

Ser Thr Gly Thr 

Val Phn Asp Arg 
38 0 

His Val His Asp 

3 95 

Thr Leu Asp Met 
410 

Ser 
425 



II© Leu Pro Gin 

335 

Gin Asp Asp "ys 
35C 

Val Met Gly Ala 
365 

Ala Arg Lys Arg 

Glu Phe Ary Thr 
40U 

Glu Asp Cys Gly 
415 



<21G> 29 
-211> 1362 
-212> DNA 

-.213 > Homo sapiens 
<4 0 0> 2 9 

Atggcccaag ccctycccty yetcctgety tyyatgygcy cyygagtgcl yccLycccdc 60 
ggcacccagc acggcatccg gctgcccctg cgcagcggcc tggggggege ccccctgggg 12C 
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ctgcgjctgc 


cc-gggagac 


cgacgaagag 


cc cgaggagc 


eeggceggag 


gggcagc zt t 


18C 


gtygagatgg 


tggacaacct: 


gaggggcaag 


ccyyygcagg 


gc t_actacg~ 


ggayargacc 


240 


gtgggcagcc 


ccccgcagac 


rjc^caacarc 


ctggtggata 


caggcagcag 


taactttgea 


300 


(jLgggtgc tg 


CCCCCCdLCC 


cLLccLqcat 


CCL-LdCLct'JU 


ayagyceig- u 


gtccag-aca 


360 


* accgggacc 


t~cggaagyy 


tgtgtatgtg 


:cctacaccc 


agygcaagtg 


ggaaggggag 


420 


<: tgggcaceg 


acc tggtaag 


cat cccccat 


ggccccaacg 


f.-actgtgcg 


tgccaacait 


4S0 


gctgccatca 


ctgaatcaga 


caagtrc etc 


accaacggct 


ccaac tqgga 


agccatcc ~g 


540 


gggctggcct 


atg jcgagat 


tgrcaggect 


gacgactccc 


tggagecit t 


c t ttgactc t 


600 


: "zggtaaagc 


aga jccacgt 


tc:caacctc 


titer, ccczgc 


acctttgtgg 


tgctggczzc 


COO 


- r.cr.rraa-.r. 


agf.rrt.gaa rjt 


gctggcctct 


gtcggacgga 


gc a Lea teat: 


^ggagcjtat" 


72C 


■:acca"tcgc 


tgtacacagg 


cagtctctgg 


ta:acaccca 


reeggeggga 


gt ggtattat 


1A0 


oaggtratca 


ttgtgcgggt 


ggagatcaat 


ggacaggatc 


cgaaaa^gga 


ctgeaaggag 


840 


l acaactdty 


acaayagca: 


tgtggacagt 


ggcaccacca 


accttcgttt 


g~ccaagaaa 


9 0, n 


: tg 1 1 ccjaag 


c tcjcagtcaa 


atc:at.caag 


g^agcctcct 


c-aeggagaa 


gttccc-gat. 


9 6 C 


jg^t tctggc 


taggagagca 


gctggtylgc 


tyycaagcag 


gcaccacccc 


ntggaacat c 


102 0 


: tcccagtca 


tctcactcta 


cctaatgggt 


gaggttacca 


accag icctL 


ccg:atcacc 


108 0 


dLccLLccqc 


agcaaLacc L 


geygecag ly 


uaayaty Lyy 




ayacyac tgt 


1140 


t.scaagnttg 


ccatctcaca 


gtcatccacg 


ggcactgtta 


tgggagctgr. 


tatcatggag 


12 0 0 


ugcttctacg 


ttgtctttga 


tcgggcccga 


aaacgaattg 


gctt tgctgt 


cagegcttgs 


12 bC 


■ratgtgcacg 


dtgagctcag 


gaeggcageg 


gtggaaggcc 


ctrttgtcac 


"titggacatg 


13 2 C 


gaagactgig 


cctacaacat 


tcca^agaca 


gatgagtcat 


ga 




1362 



<210> 3 0 
• 21V> 4S3 
- 212> PF.T 

■■. 2 1 3 > Homo sapiens 
-40O 30 

Met 7*la Gin Ala Lc; Pro Trp Leu Leu Leu Trp M^r. Gly Ala Gly Val 
15 13 15 
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Leu Pre Ala His Gly Thr Gin His Gly Tie Arg Leu Pro Leu Arg Ser 

20 25 30 

Gly Leu Gly Gly Ala Pil> L«u Gly Leu Aig Leu Piu Axy Glu T::: Abp 
35 40 45 

Glu Glu Pre Glu Glu Pro Gly Arg Arg Gly Her Phe Val Glu Me- Val 
SO 55 60 

Asp Asn Lm; Arg 'Gly Lys Ser Gly Gin Gly l s yr Tyr Val Glu N:ar. "hr 
65 70 75 80 

Val Gly Ser Pre Pre Gin Thr Leu Asn lie Leu Val Asp Thr Gly Ser 

S5 SO 9 j 

ier Asn Phe Ala Val Gly Ala Ala Pre His Pro Pho Lou His Arg Tyr 
10C 105 110 

Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
lib 120 125 

Tyr V=il Pre Tyr Thr Gin Gly Lys Tip Glu Gly Glu L=;u Gly Thr Asp 

13 0 135 14 0 

Leu Val Ser He Frc His Gly Pre Asn Val Thr Val Arg Ala Asn He 
'.45 150 155 lfiO 

Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp 
16a 17 0 175 

Glu Gly He Leu Gly Leu Ala Tyr Ala Glu Tie Ala Arg Pro Asp Asp 
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183 135 190 

::cr Lou Clu Pro Pho Phe Acp 3--r Lju VH Lyi; Gin Thr Hit; Val Pro 
195 2C0 205 



Asn Leu ?he Ser Leu Gin Leu Cys Gly Ala Gly ?he Pro L-^u Asn Gin 
^Hi 21b 22C 



■ex Glu Val Leu Ala Ser Val Gly Gly S<-r Met. He lie Gly Gly He 

2il) 23 5 24 0 



Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arc Arg 

24 5 25 0 25 5 

Glu Trp Tyr Tyr Glu Val Ho He Val Arg Val Glu lie Asn Gly Gin 

260 265 270 



Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val 

275 26 0 285 

Asi3 Ssr Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala 
290 295 300 



Ala Val Lye Scr He Lys A j a Ala Scr sar Thr Glu Lyc Pha Pro Asp 
-C5 3 1 C 3 15 320 

"Hy Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala. Gly Thr Thr 

325 33C 335 

Pre Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val 

340 " 345 350 
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Thr Asn Glr. Ser Pne Arg lie Thr lie Leu Pro Gin Glr. Tyr -e;; Arg 
355 360 365 

Pro Val CI- Asn Val Ala Thr Ser Gin Asr: Asp Cy? Tyr lys Phe Ala 
370 375 38C 

:1g sor Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val He Met Glu 

;<3 5 3 90 3 95 4C0 

Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Ly^ Arg He Gly Phe Ala 

4 OS 41 (J 4 IS 

^■ r al Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala VaL Glu 
420 425 430 

Gly Pro Phe Val Thr Lgu Asp M2_ Glu Asp Cys Gly Tyr Asn He Pro 
4 3 5 440 445 

Gin Thr Asp Glu ser 
450 

<21G> 31 

-211> 1380 

- 2 I 2 > DNA 

^213> Homo sapiens 

•-400> 31 

izggcccaag ccctgccctg gctcctgctg tggatgggcg egggagtget gcctgcccac 60 
ggcacccagc acggcatccg gctgcccctg cgcagcggcc tggggggege ccccctgggg 120 
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ctgcggc^tj:.' cc^gggagac cgacgauyug ccngaggagc ecgcccgguy gagcag^ttt 130 
gtggagar.gg tggacaacc~ gaggggcaag tcgggccagg gctactacgr. ggagatgacc 240 
gtgggcagcc ccccgcagac g:tcdacatc etggtcgata caggcagcag taactttgca 300 
ytgggtgctg c,czc~c<ac.cc ct:cctgcat cgctactacc agaggcagct gtccagcaca 3 CO 
laccggga-c tccggaaggg tg-gtatgtg ccctacaccc agggcaagtg ggaaggggag 420 
ctgggcaccg acctggtaag ca:cccccat ggccccaacg tcactgtgcg tgccaaca^t 480 
gctg^cati-a ctgaatcaga caagrtcttc aicaacggct: ccaactggga aggcatccrg 54 0 
cyyccggcjt atg~-gacat ::;ccag;:cct yacyactccc "ggaccc tt; I c^t.r.gactct 60C 
o tggtaaagc agazccacgt tcccaacctc t::utccctgc acctttgtgg tgctggcCt- 660 
ccccccaa-o agtr-gaagt g^tggcctct gccggacgga gcatgatca^ tggaggtatr: 720 
eacc-Aczcjc tgtacacagg cagtctctgy tatacaccca Lccyycygga gtggtaLtat 780 
caygccatoa Ltytycyygt yyayalcaat ggacaggatc tyaaaatyyri ctgcaaggag 840 
tacaa::a:g acaagagcat tgtggacagt ggcaccacca accL.cgttt gcccaagaan 900 
gtg^ttgaag ccgcagtcaa atccatcang gcagcctcct ccacggagaa gttccctgat 960 
cgttCctggc taggagagca gctggtgtgc lygcaagcag gcaccacccc ttgcaacatt 102C 
•■rrccagtca tctcactcta cctaatgggt garjgttacca aecagtcctt ccgcatcacc 1080 
atcctl-ccgc agcaatacct gcggccagtg gaagatgtgg c:acgtccca agacgactgt 1140 
racaagtttg ccatctcaca gtcatccacg ggcactgtta tgggagctgt tatcatggag 12 0U 
jgcttctacg ttgtcrtrga tcgggcccga aaacyaaltg gctttgctgt cagcgcttgc 1260 
ra^gtycacy aLuag-tuag yacyycaycy ytyyaayycc cttttglcetc eLLyyctcaly 1320 
gaaga-Lgtg gc:ar.aacat rccacagaca gatgagLcac agcagcagca gcagcagtga "3*0 

-.;ic> 3 2 

•:;il> 459 
•:Z12> ?RT 

<-21J> Homo sapiens 
*:4CG> 32 

Met Ala Gin Ala Leu Pro Trp Leu Leu Leu Trp Met Gly Ala Gly Val 
15 10 lb 
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Lou Pro Ala His Gly Thr CLn His Cly lie Arg Lou Pro Leu Arg Sor 
20 25 30 

Gly Leu Gly Sly Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
3 5 4 'J 4 5 

Glu Glu Piu Gl^ Glu Pre Gly Ary Aiy Gly Sei Phe Veil Glu MeL Val 

50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr val Glu Met Thr 

65 7C 75 80 

VaL Gly Ser Pro Pro Gin Thr Leu Asn lie Leu Val Asp Thr Gly Ser 
85 9a 95 

::er Asn Phe Ala Val Gly Ala Ala Prn His Pro Phe Leu His Arc Tyr 
100 105 110 

Tyr Sin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 

:yr Val Pre Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 

Leu Val Ser Tie Pro His Gly Pro Asn Val Thr Val Arg Ala Asn lie 
14b "J S 0 "^55 160 

Ala Ala He Thr Glu Ser Asp Lys Phe ?he lie Asn Gly Ser Asr: Trp 
165 170 175 

Glu Gly He Levi Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp 
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180 



185 



190 



scr Leu Glu Pre Phe Phe Asp Scr Leu Val Lys Gin Thr His Va. Pro 
195 2 00 205 



Asn Leu Phe Ser Leu Gin Leu cys 
210 21b 

Ser Glu Val Leu Ala Ser Val Gly 

.-.sp ser „eu Tyr Thr ;_-ly Ser 

24 5 

Glu Trp Tyr Tyr Glu Vai lie lie 
26C 

Asp Leu Lys Mot Asp Cys Lys Glu 

275 283 



Gly Ala Gly Phe Pro Leu Asn Gin 

22 0 

Gly Syr Met lie He Gly Gly lie 

Leu Trp Tyr Thr Pro lie Arc; Arq 
2 5 0 2 E 5 

Val Ary Val Glu Uo Asn Gly Gin 
265 270 

Tyr Asn Tyr Asp Lys Ser Tie Val 
2QS 



Asp Ser Gly Thr Thr Asn Leu Arg Leu Pre Lys Lys Val Phe Glu Ala 
290 295 300 

Ala Val Lyn Ser lie Lys Ala Ala Scr Scr Thr Glu Lye Phe Pro Asp 
305 310 315 320 

Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr 
325 330 335 

I to Trp Asn lie Phe Pro Val lie ser Leu Tyr Leu Met Gly Glu Vai 
340 " 345 350 
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'hr Asn Gin Ser 
355 

Pre Val Glu Asp 
37C 

lie Sgl" Gin Ser 

Jly Phe 17 r Val 

-Ml ser Ala Cys 
420 

Jly Pro Phe Val 
43 = 

Jin Thr Asp Glu 
450 



?hA Arrj Hp ^hr 

360 

Vai Ala Thr Ser 

375 

Ser ?hr Gly ihr 

39C 

Val Pht; Asp Arc; 
40 5 

His Val His Asp 

Thr Leu Asp Met 
440 

Ser His His His 

455 



7 1 g Lou Pro Gin 

Gin Asp Asp Gys 

33 0 

Val Met Gly Ala 

3S5 

Ala Arc Lys Arg 
4 1 0 

Glu Phe Arg Thr 
425 

Glu Asp Cys Gly 
His His His 



Gin Tyr Leu Arg 
365 

Tyr Lys Phe Ala 

Val lie Met Glu 

4C0 

I_e Gly Phe Ala 

4i 5 

Ala Ala Val Glu 
430 

Tvr Asn lie Pro 
445 



<21C> 33 

< 211> 25 

• 212> PRT 

v 2 1 3 > Homo s ap i en s 

<400> 33 

Ser Giu Gin Gin Arg Arg Pro Arg Asp Pro Glu Val VaL Asn Asp Giu 
15 10 15 
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Sor Ser Lgu Val Arg His Arg Trp Lys 
20 25 

<210> 34 
<2ll> IV 
-.212> PRT 

<213> Homo sapiens 
<^0 0> 3 4 

ser Glu (; in Leu Arg G.r. G^n His Asp Asp Phe Ala Asp Asp He Ser 
15 10 15 

Lou Leu Lys 



<210> 3 5 
<21l> 2 r J 
<212> DMA 

••21 j > Home sapiens 
<40Q> 35 

gtggatccac ccagcacggc ateeggctg 29 

<2'.0> 3b 
<211> 36 
■:212> DUh 

<2I3> Homo sapiens 
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<4CG> 3 6 

gaaag~tLtc atgacLcatc tgLctytgya at^Lzg 
-:210> 37 
<212> DNA 

* .2~j> H:;mo sapiens 



•-4(>G> 3 / 

gar.cgaf.gacr ratrrcr.gac t.ritccgcgtg aacaggacg 
■::iC> 3 8 

-:n> 3.9 

-212> DNA 

•■21J> Homo sapiens 



••4 0O> 3 3 

jatccgtr:* gttnacgcgg agagtcagag atagtcatc 



■■ ::u> 3 9 

•-211> 77 
*212> DNA 

•-21 J > Artificial Sequence 



••220> 

. 223> Description of Artificial Sequence: Ku-Asp2 



•:-10U> 39 

-ggcatccgg ctgcccctgc gtagcggtct gggtggtgct ccactgggtc tgcgzcrgcc 60 
ccgggagacc gaccaag 77 
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<-21C> 4: 

<.:n> 77 
* :. 1 2 > dna 

<. 213> Artificial Sequence 
* :20:- 

^223> Description cf Artificial Sequence: Hu-Asp2 
<40C> 40 

ct tegtcygt ctcccggggc agaegcagae ccagtggagc acea^ccaga regrtacgea fO 
ggggcagccg gatcccg 77 

■ . 1 0 > 41 

*-:u> si 

<?A2> DMA 

<j13> Artificial Sequence 
•,2 2 0> 

vj23> Description ct Artificial Sequence: Caspase 8 
Cleavage Site 

^40C> 41 

ghtcgatgac tatctctyac tctccgcugg actctggtar cgaaaccgac g SI 
- .M.0> 4 2 
* 2 1 2 > DKA 

<.113> Art^icial Spqup-.ne 
-.220> 

--223> Description of Artificial Sequence: Caspaso 8 
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C 1 eavaye Site 
«.4 0 0> 4 2 

uatccgtcqq ttLcuatacc aqd^iccaac ggagagtcag agatactcat c 



* 2"lC> 45 
• :il> 32 
-.:i2> DNA 

<I13> HoitlC sapiens 
-;4:30> 4? 

adggatcctt tgtggagatg qtggacaacc tg 

-;210> 44 

<::u> 3 6 

•-212> d:;a 

<213> Homo sapiens 



32 



<40o> 44 

gaaagctttc atgactcatc tgtctgngga atgtcg 

•-210> 45 
<::11> 74 

■.:i2> d:ja 

-:213> Artiiicial Sequence 
<O20> 

< ;> ^ > Description of Artificial Sfiqupnr*: 6-Kis rag 
<400> 4b 

gategcatea tcaccatcac catg 
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•.: ?. : n > 4 h 

24 

*,212> UNA 

-:213- Artificial Sequence 

<223> Inscription cf Artificial Sequence: 6 -His tag 
^0C> 4 6 

Lijatceatggt qatggtyatg atgc 

-210> 47 

v;;i;.. 354 

* 112 - DKA 

<213> Artificial Sequence 
c220> 

--223- Description of Artificial Sequence: introduce KK 
motif 

<—0 0> 47 

hntt ianvtt rinrmngactg accacrcgac 

ayrsksOsna yrxawsddcg tmsnwrmans 

■rsnayrsnay rhDdtgactg accactcgac 

csnayrtcna morstwrdOt dthharmaca 

nCbdOcdaOO aOcaOrtntr ygtafcwrcdc 

tnsksOycmb abc-rhvgrr ccrCrsracrs 

<.!1U> 48 
<211> 462 

83 



caggr.tcbnr macmhadata ragrahntsr. 6fi 

ymbarahrOg actgaccact cgaccaggtt 12 0 

caggttcact snayrctcsn asnanrmadt 180 

hngactgacc actcgaccag gttcttdgria 24 0 

mntsm-naryn rrnatndcmnt sirjnarynrma 3CC 

twrddcrcritm swrddcwrdd cmnt 3 54 
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* ?A2> Art. i ' nisi Scquenrc 

< 2 2 0 > 

< 223> Description of Artificial Sequence: Introduce KK 

not i r" 



^4O0> 48 














bbctaanttr. 


nnnkncgaat 


taaattccag 


cacactggc t 


acttc ttgtt 


ctgcatc tea 


60 


aagaacbrLrm 


acmhadatar 


agrahntsna 


yrsksOsnay 


rtawsddcgt 


n\snwrmanEy 


120 


mbarahrOcg 


aat taaattc 


caccacactg 


gctacttct t 


gttctgcatc 


tcaaagaacs 


13 3 


r.ayrsnayrh 


Chtcgaatta 


aattccagca 


cactggrtac 


ttrttgttct 


gcatctcaaa 


7A J 


oaacgaasna 


yrttcsnas:i 


anrmadtcsn 


ayrtcr.amcr 


s twrdDcgks 


kdhharrciaca 


303 


;.ncyaa t Laa 


at tccacjcac 


ac tggctact 


tc ttgttctg 


catcLcaaay 


aac L Ldydan 


360 


rbOcdaOOaO 


caOrtntryh 


kktafcwrddc 


nmtsrrmaryn 


nrat ndcmnt 


smmarynrma 


420 


tntdccmfcbc 


tckkr.crstw 


rddcrrntmsw 


rddcwrddcm 


nt 




462 



<210> 49 
*:211> 380 
<J12> CNA 

-■213> Artificial Sequence 
<220> 

*>223> Description of Artificial Sequence: Introduce KK 
rr.otif 

<400> 49 

hbrtaar.ttn nnnnncgaat taaattccag cacactggct afcnrniasmha dataragrah 60 

ntsnayrsks Osnayrtaws ddcgtmsnwr mansymbara hrCcgaatta aattccagca 120 

cactggctas nayrsnayrh Udhcgaatta aattccagca cactggctag aasr.ayrttc ISO 

snasnanrraa dtcsnayrtc namcrstwrd Ocmdhbarma cahncgaa_t aaattccagc 240 
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aca:Lggc"a t. t.dgdanQbO cdaU Ca CcaO 
tndcnntsmrn arynmatns ks Cycr^bnr.Tic 
dcrrLnr.mPvvrd dcwrddcmnt 



r tnt rymionT: abwrddcmnt sirirarynrma 303 
rbanbctkmk TigdgOgnrrfi rsmrr^twrd 3 60 

380 
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Claims Nos.: claims 32, 33, 35, 36, 70, 71, 76, 78 and 79 and partiall 
claims 1, 18, 28, 44, 61. 72 and 141 



Present claims 1, 18, 28, 44, 61, 72 and 141 relate to an extremely large 
number of possible products. In fact, the claims encompass so many 
possible compounds that a lack of clarity (and/or conciseness) within the 
meaning of Article 6 PCT arises to such an extent as to render a 
meaningful search of the claims impossible. 

Moreover, in view of the large number and also the wording of the claims 
presently on file, which renders it difficult, if not impossible, to 
determine the matter for which protection is sought, the present 
application fails to comply with the clarity and conciseness requirements 
of Article 6 PCT (see also Rule 6.1(a) PCT) to such an extent that a 
meaningful search is impossible. 

In addition, the obscure definition of claims 32, 33, 35, 36, 7G, 71, 76, 
78 and 79, relating to an unidentified SEQ ID. and referring to the 
examples renders as well the search of these claims impracticable. 

Consequently, the search has been carried out for those parts of the 
application which do appear to be clear, namely the particular sequences 
SEQ ID NCs.: 1, 2, 3, 4, 5, 6, and 8, variants, and uses thereof 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-31, 34, 37-69, 72-75, 77, 80-129, 
136-140 and partially 141 



Proteases capable of cleaving the beta secretase cleavage 
site of APP, variants thereof; polynucleotides encoding 
them; vectors and host cells comprising the same; antibodies 
for the polypeptides and uses of the foregoing in screening 
tests. 



2. Claims: 130-135 and partially 141 

APP isoform wherein the last two carboxy terminus amino 
acids are Lysine residues. 
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